CONTRIBUTIONS  TO  STATISTICAL  TECHNIQUES  FOR  TWO 
AND  THREE  DIMENSIONAL  MEASUREMENT  PROBLEMS 


By 
James  Robert  Lackritz 


A  DISSERTATION  PRESENTED  TO  THE  GRADUATE  COUNCIL  OF 
THE  UNIVERSITY  OF  FLORIDA 
IN  FARTIAL  FULFILLMENT  OF  THE  REQUIREMENTS  FOR  THE 
DEGREE  OF  DOCTOR  OF  PHILOSOPHY 


UNIVERSITY  OF  FLORIDA 
1977 


To  my  family  and  all  other  non-believers 


ACKNOWLEDGEMENTS 

There  are  many  people  to  whom  I  owe  my  sincere  thanks 
for  their  help  and  encouragement  during  my  graduate  years 
at  the  University  of  Florida. 

First  and  foremost,  I  will  always  be  indebted  to 
Dr.  Richard  Scheaffer  for  his  inspiration  and  unbounded 
patience  in  helping  me  in  my  quest  for  my  doctoral  degree. 
He  will  always  be  remembered,  not  just  for  his  statistical 
accomplishments,  but  as  well  for  his  great  personal  qual- 
ities as  a  human  being. 

I  also  wish  to  express  my  deepest  gratitude  to 
Dr.  William  Mendenhall  for  his  helpful  guidance  during  my 
graduate  years. 

I  would  like  to  thank  the  individual  members  of  my 
committee  for  their  help  and  interest  in  my  studies. 

Finally,  I  am  grateful  to  Betty  Rovira,  for  her 
patience  and  help  in  typing  the  final  draft. 


TABLE  OF  CONTENTS 

Page 

ACKNOWLEDGEMENTS ,  lil 

ABSTRACT vii 

CHAPTER 

1  INTRODUCTION  1 

1.1  History  of  Quantitative  Microscopy  .  .     1 

1.2  Models  for  Random  Division  of  Space   .     2 
1-3    Applications i) 

2  BASIC  STEREOLOGICAL  RELATIONSHIPS   6 

2.1  The  Mosaic 6 

2.2  Basic  Stereological  Results   7 

2.3  The  Spatial  Covariance  Function   ...  11 
2.^4    Poisson  Lines  Model 16 

2.5  Meijering  Cell  Model 17 

2.6  Boolean  Scheme \Q 

3  LINEAL  SAMPLING   21 

3-1    Introduction 21 

3-2    Lineal  Sampling 22 

3-3    Model  Development   23 

3-1!    Asymptotic  Distribution  of  the 

Estimator  L.//C   26 

3-5    Basic  Properties  of  Segment  Counts  .  .  28 

3.6    Asymptotic  Distribution  of  M.//L  ...  3A4 

3-7    Joint  Distribution  of  M//L 37 

3-8    Asymptotic  Variance-Covariance  Matrix 

of  M//L 39 

3.9    Variance  Approximations l\2 


iv 


TABLE  OF  CONTENTS  -  Continued 

Page 
CHAPTER 

3.10   The  Results  of  Hilliard  and  Cahn  ...    kS 

3-11   Poisson  Lines  Model l\q 

3.12   Meijering  Cell  Model 50 

3-13   Tests  on  the  Mosaic  Using  Segment 

Counts c>2 

3.1*1   Spatial  Covariance  Function  Approach 
to  Using  Lineal  Sampling:   Poisson 
Process cij 

3.15   Boolean  Scheme 55 

4     AREAL  SAMPLING  59 

4.1  Introduction 59 

4.2  Asymptotic  Distribution  of  A//K      ...  60 

4.3  Estimation  of  the  Variance  Components.  62 

4.4  Poisson  Process 63 

4.5  Meijering  Cell  Model 65 

4.6  Comparisons  with  the  Results  of 

Hilliard  and  Cahn 65 

4.7  Spatial  Covariance  Function  Approach  .  67 
5     POINT  SAMPLING  g9 

5-1    Introduction 59 

5.2    Random  Points  on  the  Plane 69 

5-3    Systematic  Points  on  a  Line 70 

5.4    Estimation  of  the  Variance 73 

5-5    Covariance  Between  Colors   75 

5.6    Asymptotic  Distribution  of  the 

Estimates 76 

5-7    Extensions  of  Point  Sampling  85 

5-8    Systematic  Points  on  Random  Lines   .  .  88 

5-9    Comparisons  with  Hilliard  and  Cahn  .  .  89 

6     MULTISTAGE  SAMPLING   gi 

6.1    Introduction gi 


TABLE  OF  CONTENTS  -  Continued 

Page 
CHAPTER 

6.2  Lineal  Subsampling  91 

6.3  Point  Subsampling   93 

7     NUMERICAL  RESULTS   96 

7.1  Introduction 96 

7.2  Results:   Table  I 97 

7.3  Results:   Table  II 98 

7.4  Results:   Table  III 100 

7-5    Results:   Table  IV 100 

7.6    Conclusions 101 

BIBLIOGRAPHY  107 

BIOGRAPHICAL  SKETCH   108 


Abstract  of  Dissertation  Presented  to  the 
Graduate  Council  of  the  University  of  Florida  in  Partial 
Fulfillment  of  the  Requirements  for  the 
Degree  of  Doctor  of  Philosophy 


CONTRIBUTIONS  TO  STATISTICAL  TECHNIQUES  FOR  TWO 
AND  THREE  DIMENSIONAL  MEASUREMENT  PROBLEMS 

By 

James  Robert  Lackritz 

August  1977 

Chairman:   Dr.  Richard  L.  Scheaffer 
Major  Department:   Statistics 

Estimation  of  structural  properties  of  two  and  three- 
dimensional  figures  can  be  achieved  through  areal ,  lineal, 
and  point  sampling  techniques.   Such  properties  might 
include  areal  and  volume  fractions,  surface  area  per  unit 
volume,  and  boundary  length  per  unit  area.   Unbiased  esti- 
mators for  the  individual  parameters  are  well-known,  and 
have  been  used  for  quite  some  time  in  the  research  associ- 
ated with  this  field.   Additional  properties  of  the  indi- 
vidual estimators  are  developed,  including  consistency  and 
asymptotic  normality,  and  practical  variance  approximations 
are  given.   These  properties  are  derived  under  the  assump- 
tion of  a  general  cell  model   for   the  construction  of  the 
population  figure.   Once  the  general  results  are  obtained, 
special  cases  of  more  specific  models  are  discussed. 


The  variance  approximations  for  the  estimators  are 
obtained  by  making  use  of  the  available  sampling  informa- 
tion, so  that  the  usual  inferential  techniques  of  interval 
estimation  and  hypothesis  testing  can  be  employed.   Often, 
the  assumption  of  a  special  model  yields  a  simplified  vari- 
ance expression  for  the  individual  estimators. 

A    second  approach  to  areal  and  volume-fraction  analy- 
sis makes  use  of  the  spatial  covariance  function  of  the 
underlying  model  to  generate  a  variance  expression  for  the 
areal  or  volume-fraction  estimator.   The  results  from  this 
approach  are  tied  in  with  the  results  obtained  from  the 
assumption  of  the  general  cell  model. 

The  final  chapter  assesses  the  use  of  numerical  data 
to  check  the  variance  expressions  derived  in  previous  chap- 
ters.  Four  different  figures  were  sampled,  and  the  numer- 
ical results  allow  for  the  comparisons  of  the  individual 
models . 


CHAPTER  1 

INTRODUCTION 

1 • 1   History  of  Quantitative  Microscopy 

The  problem  of  estimating  structural  properties  of 
two  and  three-dimensional  figures  has  been  studied  for 
over  100  years  by  engineers,  mathematicians,  and  stereo- 
logists.   DeHoff  and  Rhines  [51  review  the  history  of 
Quantitative  Microscopy  in  summarizing  the  basic  results 
that  have  been  developed.   The  first  work  in  this  field 
dates  back  to  1848,  when  Delesse  proved  mathematically 
that  the  area  occupied  by  each  constituent  of  a  cross- 
section  of  a  rock  is  proportional  to  its  volume  In  the  mass 
of  the  rock.   The  first  work  in  lineal  analysis  was  intro- 
duced by  Rosiwal  in  1898,  while  in  1930,  Thompson  looked 
at  measurements  from  point  counts.   Starting  in  1945,  more 
exact  statistical  relationships  for  two-dimensional  sections 
were  discovered  in  independent  studies,  first  by  Saltykov, 
and  then  by  several  scientists  in  the  United  States, 
including  Rhines.   The  new  ideas  showed  the  direct  propor- 
tionality between  the  number  oV    intersections  of  a  line 
with  a  phase  on  the  figure  and  the  boundary  length  of  the 
phase  of  a  two-dimensional  feature  or  the  surface  area  of 
the  phase  on  a  three-dimensional  feature.   Throughout  the 


the  years,  investigators  have  looked  at  the  basic  measure- 
ments that  can  be  obtained  through  lineal,  areal,  and 
point  sampling  techniques,  with  a  view  towards  developing 
estimators  for  the  basic  properties  of  the  figure.   Some 
important  recent  results  in  point  sampling  have  been  out- 
lined by  Koop  [9],  who  studied  21  different  methods  of 
point  sampling  on  the  plane  for  purposes  of  areal  fraction 
estimation.   Under  minimal  assumptions,  unbiased  estimators 
have  been  generated  by  the  researchers  for  such  parameters 
as  areal  and  volume-fractions,  feature  boundary  length  in 
two  dimensions,  and  surface  area  in  three  dimensions.   The 
culmination  of  the  individual  results  is  a  mathematical 
n-dimensional  theorem  of  Miles  [15  "J,  from  which  many  of  the 
individual  unbiased  estimators  can  be  derived.   These 
results  will  be  considered  in  more  detail  in  Chapter  2. 


1 . 2   Models  for  Random  Division  of  Space 

The  work  of  Miles  [15],  generalizing  the  earlier  indi- 
vidual results  into  one  relationship,  yields  unbiased  esti- 
mators for  the  two  and  three-dimensional  properties  with 
minimal  assumptions  on  the  underlying  figure  or  the  sampling 
scheme  from  which  the  measurements  are  taken.   Many  theoret- 
ical models  for  the  random  division  of  space  have  been 
developed  to  try  to  account  for  the  construction  of  the 
mosaic.   Several  of  these  models  will  be  discussed  through- 
out the  next  four  chapters. 


One  of  the  simplest  and  most  theoretically  investi- 
gated models  is  the  Poisson  Lines  Model,  whose  properties 
have  been  developed  and  discussed  by  many  scientists, 
including  Miles  [133  and  Switzer  [24].   This  model  allows 
for  the  division  of  space  into  a  system  of  convex  polygons 
by  placing  a  series  of  random  lines  in  a  coordinate  system, 
where  the  lines  serve  as  boundaries  of  the  individual  cells. 
Miles  ["13J  developed  many  Important  properties  of  the  cells, 
including  the  behavior  of  a  line  transect  that  is  randomly 
placed  inside  the  cell  configuration.   Switzer  [24]  showed 
that  the  alternation  among  phases  along  a  randomly  placed 
line  transect  in  the  process  is  Markovian  for  a  finite- 
phase  Poisson  Process. 

The  Meijering  Cell  Model,  the  result  of  which  is 
sometimes  known  as  Voronoi  polygons,  is  discussed  in  detail 
by  Gilbert  [6],  and  again  is  further  expanded  on  by  Miles 
[l1*].   This  model  arises  from  the  random  placement  of 
points  to  expand  outward  in  all  directions  at  a  constant 
rate  until  meeting  with  another  cell,  upon  which  the 
boundary  is  defined.   This  model,  which  can  be  paralleled 
to  a  germination-growth  model,  thus  results  in  a  set  of 
non-overlapping  convex  polygons. 

The  third  model  to  be  used  will  be  the  Boolean  Scheme, 
discussed  by  Serra  [22].   Like  the  Meijering  Cell  Model, 
this  model  starts  with  a  random  point  process,  but  then 
random  sets  are  placed  around. the  points,  allowing  for  set 
overlap,  unlike  either  of  the  previous  two  models.   Serra 


also  looks  at  a  different  approach  to  volume-fraction  esti- 
mation, which  uses  the  spatial  covariance  function  of  the 
underlying  model,  if  a  particular  model  can  be  assumed. 

The  assumption  of  an  underlying  model  for  the  mosaic 
gives  a  probability  structure  to  the  mosaic  under  consid- 
eration.  Chapters  3  through  6  serve  to  tie  the  underlying 
probability  structure  to  properties  of  the  estimators,  so 
that  additional  properties  can  be  developed.   Among  the 
properties  discussed  will  be  consistency,  asymptotic  distri- 
butions, and  variance  approximations  for  the  estimators. 
Hilliard  and  Cahn  [8]  developed  variance  approximations 
for  areal-f raction  estimators  under  certain  assumptions  on 
the  mosaic,  and  their  results  will  be  shown  to  be  a  special 
case  of  the  variance  approximations  obtained  from  more 
general  models. 


1 . 3   Applications 

The  major  results  discussed  by  DeHoff  and  Rhines  [5] 
and  generalized  by  Miles  [15]  have  diverse  applications 
into  a  wide  range  of  fields,  including  materials  science, 
ecology,  physiology,  and  biology.   DeHoff  and  Rhines  [5] 
discuss  how  a  structure  of  spherical  copper  powder  may  be 
examined  under  a  microscope  to  estimate  the  fraction  of  the 
powder  occupied  by  voids  between  the  interparticle  welds. 
An  ecologist  may  be  able  to  use  an  areal  photograph  of  a 
piece  of  land  to  estimate  the  boundary  length  for  a  partic- 
ular river  or  lake,  and  additional  ecological  examples  are 


mentioned  in  detail  by  Pielou  [16]  and  Matern  [12]. 
Weihel  [25]  presents  a  section  on  the  utilization  of 
electron  micrographs  of  liver  cells  for  useful  information 
on  cell  sizes  and  orientation,  especially  relevant  in 
cancer  research  and  drug-related  experiments.   Lang  and 
Melhuish  [10]  looked  at  estimating  lengths  and  diameters 
of  plant  roots  by  passing  a  plane  underground  through  the 
root  system. 

The  applications  are  numerous,  especially  for  the 
fields  associated  with  the  physical,  agricultural,  and 
ecological  sciences.   Many  different  useful  measurements 
may  be  obtained  through  the  various  sampling  techniques, 
not  only  for  generating  unbiased  estimators  for  the  param- 
eters, but  also  for  obtaining  variance  approximations  for 
inferential  purposes. 


CHAPTER  2 
BASIC  STEREOLOGICAL  RELATIONSHIPS 

2.1   The  Mosaic 

Throughout  the  remaining  chapters,  the  term  mosaic 
will  refer  to  a  two-dimensional  structure  composed  of 
two  or  more  distinctive  features  called  phases.   The 
general  topic  under  research  involves  the  estimation  of 
various  structural  properties  of  the  mosaic  through  geo- 
metric sampling  procedures,  and  the  development  of  addi- 
tional properties  of  these  estimators,  such  as  consistency, 
asymptotic  distribution,  and  variance  approximations.   The 
details  of  interest  on  the  mosaic  might  include: 

(1)  The  areal  fractions  of  the  mosaic  occupied  by  differ- 
ent phases,  which  in  turn  may  estimate  the  volume- 
fractions  if  the  mosaic  represents  a  planar  sectioning 
of  a  three-dimensional  figure. 

(2)  Boundary  length  of  certain  two-dimensional  features, 
which  In  turn,  may  estimate  the  corresponding  surface 
area  of  three-dimensional  features. 

(3)  Possible  feature  orientation  or  homogeneity  on  the 
mosaic . 

The  information  needed  to  generate  estimates  for  the  param- 
eters of  the  mosaic  can  be  obtained  through  many  different 


sampling  procedures,  three  of  which  will  be  discussed  in 
detail  in  the  succeeding  chapters.   They  are: 

(1)  Randomly  selecting  one  or  more  areal  sections  from 
the  mosaic. 

(2)  Randomly  locating  one  or  more  linear  transects  within 
the  boundaries  of  the  mosaic. 

(3)  Randomly  locating  a  set  of  points  on  the  mosaic. 
Estimation  of  structural  properties  of  the  mosaic  through 
areal,  lineal,  and  point  sampling  has  a  rich  history  in 
quantitative  microscopy  and  other  disciplines,  as  pointed 
out  in  Chapter  1.  The  remaining  sections  of  this  chapter 
will  review  some  of  the  known  results,  which  will  be  used 
to  build  upon  in  the  remaining  chapters. 


2 .2   Basic  Stereological  Results 

Throughout  the  years,  researchers  in  the  field  of 
quantitative  microscopy  have  strived  to  develop  unbiased 
estimators  for  the  parameters  of  the  mosaic.   The  popu- 
lation quantities  of  interest  include  volumes  and  surface 
areas  of  three-dimensional  phases  and  areas  and  boundary 
lengths  of  two-dimensional  phases.   The  sampling  devices 
used  are  planes,  lines,  and  points,  so  that  the  sample 
measurements  arise  from  the  intersection  between  the 
sampling  device  and  the  phase  of  the  mosaic.   The  measure- 
ments from  an  areal  section  include  the  total  area  of  the 

t  h 
sample,  A,  and  the  area  occupied  by  the  j    phase,  denoted 

by  A.  .   The  lineal  transect  measurements  of  interest  are  L, 


the  length  of  the  transect,  L.,    the  total  length  of  the 
intersection  of  the  line  with  the  j    phase,  and  I.,  the 

number  of  intersections  of  the  transect  with  the  boundaries 

„  , ,    .  th         „ .  . 
oi  the  j    phase.   Similarly  for  point  sampling,  the  number 

of  points,  n,  and  the  point  count  for  phase  j,  N.,  will  be 

important  measurements  to  the  experimenter. 

A  set  of  unbiased  estimators  for  the  parameters  of  the 

mosaic  that  were  developed  individually  by  the  various 

scientists  over  the  years  were  generalized  by  Miles  [15] 

into  a  single  result  based  on  an  exercise  in  n-dimensional 

geometrical  probability.   Miles'  theorem  states: 

To  estimate  a  t-dimensional  quantity  of  a 
d-dimensional  mosaic  using  an  s-dimensional  sampling 
device  yielding  ( s  + t  -  d  ) -dimensional  measurements, 
for  s,  t,  and  d  non-negative  integers  such  that  t<d 
and  s  <  d,  let 

Dd  =  the  measure  of  the  population  figure 

D^  =  the  measure  of  the  intersection  of 
the  sampling  device  with  the  popu- 
lation figure 

Xfc  =  the  measure  of  the  feature  of  interest 
on  the  population  figure 

Xs+t-d  =  the  measure  of  tne  intersection  of  the 
sampling  device  with  the  feature  of 
interest . 

(Here,  measure  is  defined  to  be  volume  in  three 
dimensions,  area  in  two  dimensions,  length  in  one 
dimension,  and  point  counts  for  zero  dimensions.) 


Suppose  the  sampling  device  is  randomly 
located  in  the  space  and  that  D  is  not  zero. 
Then 


where 


E(Xs+t-d'  xt 

d 


YUT)' 


(2.2.1) 


kd(s,t)  = 


s  +  1 


t  +  1 


+  t-d  +  l|fd  +  l 


2 


2 


(2.2.2 


Looking  at  the  individual  parameters  of  interest,  the 
estimators  obtained  from  this  single  relationship  will  be 
the  same  as  those  individually  developed,  as  discussed  in 
the  history  of  quantitative  microscopy  by  DeHoff  and 
Rhines  [51. 

Suppose  the  parameter  of  interest  is  the  volume- 
fraction  occupied  by  a  particular  phase.   Let  V  denote  the 

volume  of  the  three-dimensional  figure,  and  V.  the  volume 

J 

of  phase  j  on  the  figure.   If  an  areal  sample  is  used, 
then  the  intersection  of  the  planar  sample  with  phase  j 
will  be  the  phase  j  areas  on  the  plane.   With  d=t=3  and 
s-2,  k3(2,3)=l,  so  that  E(A,  )/E(A)  =  V./V.   Similarly, 
for  a  lineal  transect,  where  d  =  t  =  l,  and  for  point 
sampling,  where  d  =  t  =  0,  one  has 


E  ( N  ) 
E(n) 


E(L.)    E(A.)    V. 
J   =     J   _  _J_ 
E ( L )     E(A)     V 


(2.2.3) 
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Also,  for  fixed  areal  fraction,  A. /A,  when  d = t  = 2 
lineal  and  point  sampling  will  yield 


E(N . )    E(L . ) 

E(n)  =  E(L) 


(2.2.i|) 


Tt  should  be  noted  from  (2.2.3)  and  (2.2.4)  that  in 
estimating  a  volume,  areal,  or  lineal  fraction   with  a 
lower-dimensioned  sample-fraction,  unbiased  estimators 
result  only  when  the  intersection  between  the  sampling 
device  and  the  population  figure  of  interest  is  constant. 
For  example,  if  an  areal  sample  is  used,  where  A. /A  esti- 
mates V./V,  and  A  is  a  non-degenerate  random  variable, 
then  E(A./A)  ¥■  E(A.  )/E(A),  so  that  A  . /A  will  be  a  biased 

J  J  J 

estimator  of  V./V.   In  practice,  this  problem  can  be 
eliminated  by  predetermination  of  the  sample  "size,"  and 
locating  the  sampling  scheme  entirely  within  the  borders 
of  the  population  figure. 

A  second  parameter  of  interest  on  a  three-dimensional 
figure  is  the  surface  area,  S.,  for  a  particular  phase  j. 
If  the  figure  is  sampled  by  a  planar  section  of  area  A, 
the  intersection  of  the  plane  with  the  phase  will  be  the 
boundary  length,  B.,  of  the  phase  on  the  plane.   Therefore, 
with  d  =  3,  t  =  s  =  2, 


r(f)r(f)  v 
k3(2'2)  "  r ( l ) r ( 2 )"  "  ii 


1] 


so  that 


V 


7T  E(A) 


(2.2.5) 


Similarly,  if  the  boundary  length  per  unit  area  for  a 
phase  j  is  to  be  estimated  from  a  lineal  transect,  the 
sample  measurements  of  interest  are  the  number  of  inter- 
sections, I.,  the  transect  makes  with  the  phase  j  boundary 
With  d  =  2,  s  =  t  =  1, 


k2(l,l) 


=  r(pr(i) 
r(|)r(|) 


so  that 


E(I.) 


2  ~ElhT   "   A 


(2.2.6) 


implying  that  twice  the  number  of  intersections  per  unit 
length  of  the  transect  with  the  phase  boundary  also  serves 
as  an  unbiased  estimator  for  the  surface  area  per  unit 
volume.   Again,  as  mentioned  previously,  it  will  be  essen- 
tial that  the  denominator  of  the  estimator  be  fixed  to 
assure  that  the  estimator  is  unbiased  for  the  parameter. 
For  a  treatment  of  the  biased  estimation  case,  see  Davy 
and  Miles  [4]. 


2 . 3   The  Spatial  Covariance  Function 

A  large  portion  of  Chapters  3,  *J ,  and  5  will  be 
devoted  to  getting  variance  approximations  for  the  afore- 
mentioned estimators  of  Section  2.2.   The  notion  of  a 


12 

spatial  covariance  function  will  be  employed  in  some  of 
this  work. 

Consider  a  two-dimensional  mosaic,  G,  of  fixed  over- 
all size  and  any  set  6,  of  random  size  and  location  within 
the  borders  of  G . 

Define 

f(x)  =  1   if  a  point  xeG  falls 
in  set  3 

=  0  otherwise 

so  that  P(x e  3)  = ECf (x)].   The  expectation  of  f(-)  can  be 

found  by  first  conditioning  on  3  being  of  fixed  size,  and 

then  taking  the  expectation,  £„(•),  over  the  sizes  of  3. 

p 

Thus, 

E[f(x)]  -  EQ{E[f(x)  I  3  fixed  size]} 


1 
3  I  mes  G 


E  (mes  3) 

mes  G 


f (x)dx 


where  mes(')  denotes  volume  in  three  dimensions,  area  in 
two  dimensions,  length  in  one  dimension,  and  point  counts 
in  zero  dimensions.   For  the  remainder  of  this  section,  let 
p  =  E„ (mes  3) /mes  G . 

For  two  points,  x  and  x+h,  a  distance  h  apart  and  at 
an  angle  a  from  a    fixed  orientation,  let 

C(h)  =  E[f (x)f(x+h)]  =  P[x,x+he3] 

=  E0{E[f (x)f (x+h)  I  3  fixed  size]} 
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3  mes  G 


f(x)f(x+h)dx 


G 


E  (mes  3  n 


-h 


mes  G       ' 

where  3_h  represents  the  set  obtained  by  translating  all 
points  in  (3  by  a  distance  h  in  the  direction  opposite  to  a. 
Note  that 

(i)  C(0)  =  p 

(ii)  C(h)  =  C(-h) 

(iii)  C(0)  >  C(h) 

(iv)  C(co)  =  p2. 

The  spatial  covariance  function,  cov(h),  is  defined 


by 


cov(h)  =  E[f(x)f(x+h)]  -  E[f (x)]E[  f (x  +  h)] 


C(h)  -p*. 


Furthermore,  if  3  is  a  two-dimensional  feature  defined  on 
the  mosaic,  an  important  result  involving  the  covariance 
function  is  that  the  expected  boundary  length  of  3  per  unit 


area  is  equal  to  -it  -rr-  cov(h) 
dh 


-tt  C  (0)  . 


h=0 


If  p  is  to  be  estimated,  let  the  set  S  denote  the 
sampling  device  on  the  mosaic,  where  S  might  represent  a 
subarea,  one  or  more  line  transects,  or  a  set  of  points 
dropped  on  the  mosaic. 

Let 


r 


f  (  x  )  g  (  x  )  d  x  , 


in 

where  g(x)  is  some  weight  function  defined  on  the  samplinj 
scheme.   In  particular,  g(x)  could  be  a  simple  function, 
defined  by 


S(X)  =  m^Tl3   lf  xe 


=  0 


for  which  case 


Now. 


where 


1 


g   mes  S 


E(Ig)  =  E{[  j   f(x)g(x)dx]c} 
G 


otherwise 


f (x)dx 


,2 


=  EC  [  f(x)g(x)dx  f  f(y)g(y)dy] 
G  G 

=  E[  J  j  f(x)f(y)g(x)g(y)dx  dy] 


G  G 


=  EM   j  f(x)f (x+h)g(x)g(x+h)dx  dh] 
'G  JG 


i   |  E[f(x)f(x+h)]g(x)g(x+h)dx  dh 
JG  JG 

I   C(h)  j  g(x)g(x+h)dx  dh 


G       'G 

C(h)G(h)dh  , 


0(h)  =  j   g(x)g(x+h)dx 

G 
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so  that 


V(Ig)  =  E(I2)-E2(Ig) 


G 


C(h)G(h)dh-  E[  [  f(x)g(x)dx]E[  f  f(y)g(y)dy] 
G  JG 

=  {  C(h)G(h)dh-|  j  p2g(x)g(x  +  h)dx  dh 

(2.3.1 


cov(h)G(h)dh  . 


Thus,  knowledge  of  the  covarlance  function  for  a  particular 
structure  can  result  in  a  variance  expression  for  the  area 
occupied  by  the  random  set  6.   Note  that  when 


g  ( x  )  = 


I 


1 

mes   S 

X    6    S 

0 

otherwi  se 

and  the  covarlance  function  is  stationary  and  isotropic, 
then  for  two  random  points  x  and  y  located  on  S 


V(I  )  = 


1 


(mes  S)' 


2      cov(  |y  -  x |  )dx  dy 


S  S 


l     r°° 

n  I  cov(h)b(h)dh  , 

R   .d^   J 


(mes  S) 


0 


where  b(h)  is  the  probability  density  function  of  h=  |y-x 
With  the  knowledge  of  the  spatial  covarlance  function 
being  necessary  to  obtain  a  variance  expression  for  the 
areal-fraction  estimator,  a  model  for  the  construction  of 
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the  mosaic  would  be  very  useful.   The  three  models  intro- 
duced in  Chapter  1  will  now  be  discussed  in  greater  detail 
so  that  they  may  be  used  in  the  development  of  additional 
properties  of  the  estimators. 

2 . H      Poisson  Lines  Model 

As  mentioned  in  Chapter  1,  the  Poisson  Lines  Model 
is  one  of  the  most  common  and  widely  investigated  models 
for  the  random  division  of  space.   As  defined  by  Switzer 
[24],  this  model  allows  for  the  random  division  of  space 
by  a  set  of  random  lines  constructed  as  follows: 

Upon  location  of  an  origin,  "0,"  at  the  center  of 

the  region,  select  n  pairs  of  points  {(r   6  )• 

i '    i 

i=l,2,...}    such    that     n     has    a    Poisson   distri- 
bution  with   parameter    t,    and    G,    is    uniform   on    the 
interval    ( 0 ,  tt  )  .       For    each   pair    of   points,    fit    the 
line 


xcose.  +  y  s  in  6  .  -r.  =0 

i  ii 


This  constitutes  the  realization  of  a  Poisson  Process  with 
intensity  parameter  t/tt.   The  realization  of  this  process 
leaves  the  area  with  a  number  of  intersecting  random  lines, 
resulting  in  a  set  of  convex  polygons.   Properties  of  this 
model  have  been  discussed  and  developed  by  several  mathe- 
maticians, including  Switzer  [24]  and  Miles  [13].   Miles 
introduced  some  key  results  for  areal  fraction  estimation: 


17 

(1)  The  points  at  which  a  line  transect  intersects  the 
random  lines  form  a  Poisson  Process  of  density 

X   =  2t/tt. 

(2)  The  distribution  of  the  areas  of  the  individual  cells 
has  mean 


M  = 


2.14.1) 


and   variance 


2      ? 
iCAl.    -2) 

— Tj 
2t 


(2.^.2) 


2  .  5   Meijering  Cell  Model 

The  construction  of  the  Meijering  Cell  Model,  as 
discussed  by  Gilbert  [6]  and  Miles  [ll],  produces  cells  of 
convex  polygons  from  an  underlying  Poisson  Point  Process 
of  intensity  x.   Each  point  expands  outward  at  a  constant 
rate  until  it  meets  with  another  cell,  or  equivalently ,  the 
cell  assigned  to  a  point,  y, ,  on  the  process  contains  all 
of  the  points  that  are  closer  to  y,  than  to  any  other 
point  y   (j  ?  i)    of  the  process.   For  a  two-dimensional 
model,  Gilbert  [6]  showed  that,  with  A.  denoting  the  area 
of  the  cell  generated  by  y  , 


and 


E(A.  )  =  1.018  t" 


V(A.)  =  .228  t  7 


(2.5.1) 


(2.5-2) 


2 . 6   Boolean  Scheme 

As  developed  by  Serra  [22],  the  Boolean  Scheme  is  a 
model  which  allows  for  an  overlap  among  cells. 

Let  y1>y2J---  be  tne  realization  of  a  planar  Poisson 

Point  Process  of  intensity  t.   Let  B    ,&,...    be  a  collection 

of  independently  and  identically  distributed  (i.i.d.)  random 

sets,  with  each  &      implanted  at  a  corresponding  y  .   The  set 

6  =  u  &1    is  called  the  realization  of  the  Boolean  Scheme, 

1 
which  will  be  known  as  phase  one.   The  remainder  of  the 

planar  mosaic  is  known  as  phase  two.  Serra  notes  that  the 

Boolean  Scheme  "represents  one  of  the  first  steps  in 

modeling,  when  one  admits  negligible  interaction  between 

the  particles  C  6±  J . "   This  kind  of  model  might  be  suggested 

for  use  in  crystalline  growth  and  land  studies. 

For  estimation  purposes,  suppose  the  resulting  mosaic 
is  sampled  by  S,  a  fixed  set  located  at  random  inside  the 
the  mosaic,  G,  noting  that  S  may  represent  a  subarea  of 
the  mosaic,  one  or  more  lineal  transects,  or  a  set  of  points 
developed  on  the  mosaic. 

Suppose  8.  is  located  at  x.   Then 


Therefore , 


6 ,  ©  S  =  { x  :  6  .  n  S  ji  <$> }  . 

J  J 


E  mes  (6.  © S) 

P  ( S  n  6  .  )  = J 

J         mes  G 


Defining  N  (S)  to  be  the. number  of  6.  sets  that  inter- 
sect with  S,  it  can  be  shown  that  N  (S)  has  a  Poisson 


1Q 


distribution  with  parameter  t  Ernes  (6©S).   For  any  point 
x  e  G ,  define 

1     if  x  e  6 
0     otherwise, 


k(x)  - 


and 


K(h)  =  J  k(x)k(x+h)d 


x  =  mes  ( 6  n  3 


-h;  * 


If  S-{x,x+h},  the  set  of  two  points  differing  by  the 
vector  h,  then 


mes  ($®S)    =  2  mes  6  -mes  ( 6  n  6_,  ) 
=  2K(0)  -  K(h)  . 


Therefore , 


C2(h)  =  P(x,x+b  e  3C) 


PEN  (S)  =  0] 


exp[-x{2K(0)  -  K(h) }] 


Defining 


q  =  C„(0)  =  P(x  c  6C)  = 


-tK(0) 


C0(h)  -   q2eTK(h) 


Also,  with 


P(x,x+h  e  B°)  =  P(x  e  3C)  -  P(x  e  6C  ,  x  +  h  e 


then 


P(x  c  6  ,  x+h  eg)  =  q  -  q  e 


2  xK(h) 


so  that 


C.  (h)  -  P(x,x+h  e   3)  =  1  -  2q  +q2eTK(h) 
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Therefore , 


cov(h)  =  C1(h)  -  p2  =  C2(h)  -  q2 

2  TK(h)    2 
=  q  e      -  q 


(2.6.1 


The  estimation  of  p  and  q  can  now  be  achieved  through  the 
use  of  the  covariance  function  technique  discussed  in 
Section  2.3. 

With  models  available  for  the  construction  of  the 
mosaic,  the  following  chapters  will  serve  to  not  only 
further  develop  the  properties  of  Miles'  estimators,  but 
also  to  integrate  these  results  with  the  available  models 
for  a  total  culmination  of  results  available  under  varying 
conditions  and  assumptions  on  the  mosaic. 


CHAPTER  3 
LINEAL  SAMPLING 

3  •  1   Introduction 

Consider  a  two-dimensional  mosaic  composed  of  two  or 
more  phases.   This  mosaic  could  arise  as  a  cross  section 
of  a  three-dimensional  figure  or  as  a  subsection  of  a 
larger  planar  figure.   The  purpose  of  this  chapter  is  to 
investigate  properties  of  estimators  of  planar  features 
such  as  areas  and  boundary  lengths,  in  the  case  of  sampling 
by  line  transects.   An  areal  photograph  of  a  section  of 
land  may  provide  an  environmentalist  with  estimates  for  the 
fraction  of  the  land  occupied  by  different  "phases,"  such 
as  trees  or  water  reservoirs,  along  with  possible  additional 
information  such  as  a  boundary  length  or  the  length  of  a 
perimeter-   around  a  lake.   A  metallurgist  may  be  interested 
in  estimating  the  volume-fraction  of  the  void  in  a  piece  of 
metal,  while  a  medical  technologist  might  examine  a  cross- 
section  of  a  liver  to  estimate  the  fraction  of  the  wall 
occupied  by  a  certain  type  of  cell. 

The  two  properties  of  the  mosaic  to  be  studied  in 
detail  are : 

(i)   The  areal  fraction  for  each  of  the  k  phases,  which 

may  in  turn  estimate  the  corresponding  volume-fractions 
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of  the  phases,  if  the  mosaic  arises  from  a  three- 
dimensional  figure. 

(ii)   The  boundary  length  for  a  given  feature,  which  in 
turn  is  related  to  the  surface  area  of  a  corre- 
sponding three-dimensional  feature. 
Estimation  of  these  properties  will  be  studied  through 

lineal,  areal,  and  point  sampling  techniques,  the  first  of 

which  is  studied  in  this  chapter. 


3 • 2   Lineal  Sampling 

This  section  discusses  the  sampling  procedure  of 
randomly  locating  one  or  more  transects  of  fixed  length 
entirely  within  the  borders  of  the  mosaic,  from  which  at 
least  two  sets  of  measurements  will  be  obtained.   The  first 
will  be  the  fractions  of  the  transect  passing  through  each 
of  the  k  phases,  which  will  be  used  to  estimate  the  corr- 
sponding  areal  fractions  for  each  of  the  k  phases.   Another 
measurement  of  interest  will  be  the  segment  counts  on  the 
transect  for  each  of  the  k-phases,  which  will  be  useful 
not  only  in  estimating  the  boundary  length  for  each  of  the 
phases  but  also  will  contribute  to  approximating  the  vari- 
ance of  the  areal-fraction  estimators.   The  segment  counts 
also  will  be  used  as  a  basis  of  a  test  for  preferred 
orientation  for  features  on  the  mosaic. 
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3 • 3   Model  Development 

The  first  step  of  the  estimation  process  is  the  devel- 
opment of  some  asymptotic  properties  of  the  areal-fraction 
estimators.   This  will  be  done  through  the  assumption  of  a 
model  for  the  construction  of  the  mosaic.   The  assumed 
model  considers  the  mosaic  as  a  union  of  a  set  of  non-over- 
lapping cells  with  cell  sizes  having  finite  second  moments. 

Each  cell  is  independently  assigned  to  phase  j  with  proba- 

k 
bility  p  >  0  for  j  =  1,2,. . . ,k  and   £  p  = 1 .   Thus,  a  feature 


j  =  l 


may  consist  of  one  or  more  adjoining  cells  of  the  same  phase. 
A  sampling  transect  of  length  L  consists  of  one  or  more  seg- 
ments, each  segment  being  composed  of  a  random  number  of 
intersections  with  the  original  cells.   Numerous  phases  may 
appear  on  an  individual  transect,  but  there  is  no  guarantee 
that  all  k  phases  will  show  up  on  the  transect. 

Recall  from  Section  2.h    that  both  the  Poisson  Lines 
Model  and  the  Meijering  Cell  Model  result  in  cells  with 
finite  second  moments,  and  hence,  could  result  in  a  mosaic 
as  discussed  above.   In  practice,  cells  may  be  distin- 
guishable in  some  cases,  such  as  in  observing  the  cross- 
section  of  the  wall  of  the  liver.   However,  in  most  cases 
where  such  a  model  may  be  employed,  the  individual  cells 
may  not  be  distinguishable.   Also,  many  mosaics  may  not 
even  have  an  underlying  cell  structure,  especially  when  a 
large  continuous  feature  such  as  a  lake  or  river  is  studied, 
but  the  cell  model  will  still.be  assumed  in  order  to  develop 
asymptotic  properties  of  the  estimators.   The  three  specific 
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models  for  cell  development  from  Chapter  2  will  be  discussed 
later  in  more  detail,  but  first  the  results  will  be  derived 
in  terms  of  a  general  cell  model.   A  few  terms  will  now  be 
defined . 

Let  L  equal  the  length  of  the  sample  transect  randomly 

located  on  the  mosaic,  and  let  X.  denote  the  intercept 

th 
length  of  the  i    cell  cut  by  the  transect.   It  is  assumed 

that  the  cell  intercepts  cut  by  the  transect  have  lengths 
that  are  independently  and  identically  distributed  with 
finite  mean  and  variance,  u  and  o2  respectively.   (This  is 
indeed  true  for  the  Poisson  Lines  Model.) 
Let 

th 


ij 


1    if  the  i    cell  cut  by  the 
transect  belongs  to  phase  j 

0    otherwise. 


and  let 


N   =  the  number  of  intersections  between 

the  transect  of  length  L  and  the  cell 
boundaries . 


Generally,  the  value  for  N   will  not  be  obtainable  from 

the  data,  unless  the  individual  cells  are  distinguishable. 

Estimates  for  the  expected  value  of  N  ,  denoted  by  E(N  ), 

will  be  discussed  later  in  this  chapter. 

Defining    L.    by 
J 


L  .    =       )'    <j>  .  .  X  . 
J         i  =  i    XJ     x    ' 


(3.3-1) 
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Lj  will  denote  the  total  length  of  the  intercepts  of  phase 

j  on  the  transect  of  length  L.   Note  that  {$   X. }  are  i.i.d. 

with  mean  p.u  and  variance  p  .  ( 1  -  p  .  )y 2  + p  .  02 . 
"  J      J       J 

The  cell  intercepts  on  the  transects  can  be  viewed  as 
a  realization  of  a  renewal  process  by  considering  the 
intersection  points  as  the  times  of  occurrences  of  different 
events  (phase  assignment).   A  standard  result  of  renewal 
theory  is  that  the  average  number  of  events  per  unit  time 
converges  to  the  inverse  of  the  average  length  of  time 
between  events,  or  equivalently , 


E(N 
L 


(3-3.2) 


as  L -»■«>,  assuming  u  >  0. 

In  looking  at  the  limit  of  E(L,/L),  Wald's  Theorem 
C23J  can  be  used  since 


Therefore , 


E(*i.iV  =  <v 


lim  E(L  /L)  =  p.  , 


L-*c 


(3.3-3) 


implying  that  L./L  is  an  asymptotically  unbiased  estimator 
of  P.- •   Also,  by  the  Weak  Law  of  Large  Numbers  and  the  fact 
that 

*    P  1 

N  /L  -t>  i  , 


L./L  is  a  consistent  estimator  for  p.  as  L  ■*■  °°. 
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The  underlying  mosaic  is  considered  to  be  infinite  in 
size,  so  that  a  sampling  transect  can  intersect  a  large 
number  of  cells.   This  leads  to  an  additional  consider- 
ation on  the  mosaic.   Since  the  cells  have  equal  expected 
cell  size,  the  total  areal  fraction,  P.,  for  the  jth  phase 
on  the  mosaic,  will  be  approximately  equal  to  p .  .   Thus, 
Lj/L  can  be  thought  of  as  a  consistent  and  unbiased  esti- 
mator of  F .  . 
J 

3.4   Asymptotic  Distribution  of  the  Estimators  L.//L 

Define  the  vectors  L  and  p  by 


-'  =  (L1  L2  "'  Lk.) 
P'  =  (Pjl  P2  *••  Pk) 


The  next  step  in  the  estimation • procedure  is  to  derive  the 
asymptotic  distribution  of  /pCL),  which  will  allow  for  the 
usual  inferential  techniques  of  hypothesis  testing  and 
interval  estimation.   Point  estimation  has  already  accounted 
for  j-L  being  a  consistent  and  asymptotically  unbiased 
estimator  of  p_.   The  theory  behind  cumulative  processes 
yields  the  theorems  which  give  the  joint  asymptotic  distri- 
bution of  T^(h-p).   To  follow  the  definition  of  a  cumu- 
li 
lative  process  as  stated  by  Smith  [23],   J"  d> .  .  X .  is  defined 

1  =  1  lJ  x 
to  be  a  cumulative  process  if: 

(i)   For  fixed  j,  ^.X..  is  a  sequence  of  i.i.d.  random 
variables  for  1  =  1,2,... 
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(ii)  _l    *ijXi  is5  with  probability  one,  of  bounded  vari- 
ation over  every  finite  interval  on  the  line. 
The  first  condition  holds  by  assumptions  already  stated. 
The  second  holds  since  N   is  finite  with  probability  one 
on  any  finite  interval,  and  <j> .  .  X .  is  non-negative  and 
finite . 

The  asymptotic  distribution  of  7HL  -  p_)  now  can  be 
derived  as  a  result  of  Theorem  10  of  Smith  [231: 
n 


>.,X .)  are  k  cumulative 


If  J^iA  *i2x.  • 

processes  and  o2  <  *>,  then  the  vector  /y7L(L  -  p_) 
will  converge  in  distribution  as  L  ->  °°  to  a  multi- 
variate normal  random  variable  with  zero  mean 


vector  and  variance  matrix  B,  where  b  ,  ,  the  st 
element  of  B  is  given  by 


th 


bst  -  cov(((..sX.  -psx.  ,  4>.tX.  -PtX._).    (3.1.1) 


To  obtain  the  individual  terms  of  the  asymptotic 
variance-covariance  matrix, 


b  .  .  =  E[(d>.  .  -  p,  )  X  1 


=  Ef(*1(j  -Pj)  ]E(xp 


nee    X^    and    <J> .  .    are    independent.       Therefore 


bjj    =    Pj(3  -  p.  )(o?  +11 


(3.1.2 


For  s/t,  the  covariance  term, 


bst  =  E[(*is-Ps)(*it-Pt)Xi] 

=  E[(*ls-Ps)(*.t-Pt)]E(x2) 


=  -PsPt(a2  +  u  )  . 


(3.^.3 


Therefore,  as  L  gets  large,  the  variance  of  L./L  can  be 
approximated  from  the  above  asymptotic  variance.   This 
term,  which  will  be  denoted  as  v..,  can  be  written  as 


v .  . 
JJ 


_  pj(i  -Pj)(°2  +K  )   p.O  -p.)(i  +^j) 


Lu 


E(N*) 


(  3  •  4  .1| 


The  corresponding  covariance  term  can  now  be  approximated 
by 


st 


-P  P<_(1  +  ^r) 

s^t ;    u2 

E(N  ) 


(3.4.5) 


for  s/t.   Now,  the  regular  inferential  procedures  of 
hypothesis  testing  can  be  employed  on  the  fractions  p  ,. 
..,Pk,  once  estimates  can  be  made  for  u,  c2,  and  E(N  ). 


3- 5   Basic  Properties  of  Segment  Counts 

With  the  establishment  of  the  variance-covariance 
approximations  for  the  lineal  estimates,  the  next  step 
is  to  try  to  estimate  the  modeling  parameters  of  (3.4.4) 
and  (3.4.5).   Note  that  in  using  L./L  to  estimate  p.,  u 
and  a2  are  parameters  of  the  mosaic,  and  will  have  to  be 
estimated  to  get  a  reasonable ■ approximation  for  v..  and 
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vst'   Additional  information  that  can  be  gathered  from  the 
sampling  transect  may  be  useful  for  this  purpose.   It  is 
assumed  that  the  individual  X.'s  cannot  be  measured. 
Instead,  the  transect  appears  as  a  series  of  segments  of 
different  phases.   Each  phase  j  segment  is  composed  of  a 
random  number  of  cell  intercepts,  that  random  number  having 
a  geometric  distribution  with  parameter  p..   The  segment 
counts  for  each  phase  not  only  provide  useful  information 
m  estimating  the  V  matrix,  but  also  allow   for  boundary 
length  estimation  of  individual  phases,  along  with  a  test 
for  orientation  and  homogeneity  of  features  on  the  mosaic. 
Define 


ij 


1 


st 


if  the  (i-1)    cell  intercepted  by  the 
transect  belongs  to  phase  j  and  the  i-th 
cell  intercepted  is  not  a  phase  j  cell 


0    otherwise 


and  let  M  =  £  f    for  j  =l,2,...,k.   M,  will  represent 

J   1=2  ±J  J 

the  number  of  Individual  segments  of  the  j    phase  on  the 

transect.   For  a  mosaic  with  a  large  number  of  cells,  the 
Mj's  can  be  used  to  develop  the  theory  behind  the  practical 
use  of  the  segment  counts  in  variance  approximations,  bound- 
ary length  estimates,  and  orientation  tests.   Recall  from 
(2.2.6)  that  tt/2  times  the  number  of  intersections  per  unit 
length  that  the  transect  makes  with  the  cell  boundaries 
gives  an  unbiased  estimate  of  the  boundary  length  per  unit 
area  for  the  phase  j  features,  with  a  corresponding  result 
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for  surface  area  in  three  dimensions.   In  defining  I   to 

J 
be  the  number  of  intersection  points  the  transect  makes 

with  the  boundaries  of  phase  j  features,  I  =2M.  unless 


the  test  line  begins  or  ends  with  a  phase 


j  region.   In 


any  case,  2Mj  gives  a  reasonable  large  sample  approximation 


of  I    so  that  any  asymptotic  results  for  I.  come  about 
J  J 

looking  at  corresponding  results  for  M.. 

J 

Define 


by 


and 


Jn  =  the  phase  of  the  nth  cell  cut 
by  the  transect 


F  . (x)  =  P(X  <  x  | J   .  =  v,  J  =  j  ) 
vj         n     '  n-1    *      n   J 


The  vector  M'  =  (M1  M2  •••  Mk)  constitutes  a  vector  of 
functions  defined  on  a  Markov  Renewal  Process  by  definition 
of  Pyke  [17  J,  with  transition  matrix 

(pvJ)  -  (pj)  , 

since  cells  are  independently  shaded,  and  sojourn  times 
governed  by,  {F'vj.  (x)}.  The  joint  asymptotic  distribution 
of  7E-  w111  be  derived  through  the  use  of  limit  theorems 
developed  by  Pyke  and  Schaufele  [18].  To  equate  the 
terminology  of  Markov  Renewal  Processes  with  the  sampling 
results  from  the  lineal  transects,  a  few  additional  terms 
need  to  be  defined.   Let 


6  .  =  < 

vj 


1    if  v  =  j 
0    otherwise, 
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(J)  = 


(expected  number  of  phase  v  cells 
crossed  by  the  transect  between  two 
consecutive  segments  of  phase  j)  +6 


p  /p  .  , 
v   j 


vj 


k 

I   P 

j  =  l 


vj 


Vtt 


expected  return  distance  to  phase  j 
on  the  transect 

v  (j) 

v 


and 


M 

v  wr 


u/P.  , 


expected  number  of  cells  of  phase  r 

on  the  line  between  a  segment  of  phase 

w  until  the  next  segment  of  phase  v 


Pr 

—  (1  -  6   )  +  6 

P      vv    wr 


The  theorems  needed  to  derive  the  asymptotic  distri- 
bution of  M  are  all  stated  in  terms  of  the  moments  of  f 

ij 
so  the  next  step  is  to  determine  these  values.   Without 

loss  of  generality,  the  attention  will  be  focused  on  phase 

1,  since  the  procedures  used  to  look  at  the  other  phases 

will  be  identical.   For  any  other  phase  j,  all  subscripts 

for  phase  one  can  be  replaced  by  the  pertinent  phase  j, 

for  j  =2,3,...,k.   To  look  at  the  moments  of  f.-,,  let 

ll 


^•Jf-i  )  =   f  f-iP-  dF.  (X.) 
Jm   il     J   il  jm   jm   1 
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Pm   if  J  =  1,  m^l 

0     otherwise  , 


d^Cf,,)  =   f  f,2lP.  dF.  (X.) 
jm   il     J   il  jm  jm  i 


Pm   if  j  =  1,  ra^l 

0     otherwise  , 


C.1(fil>  ■  Ulm<fil> 


m=l 


0   ■   j  /  1 
1-PX    .1=1, 


and 


J    H    ^jm   il 


0      j  *  1 
1  -  P2   J  =  1 


Using  the  Strong  Law  of  Large  Numbers  given  by  Fyke  and 
Schaufele  for  functions  defined  on  a  Markov  Renewal 
Process , 
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imJuVrii) 


1     a .  s  .       j  =  1 

r  ►   i^j-       -  (3.5.D 


Since 


Vfil}    = 


0  if   j  ?  1 

1  "  P1  J  =  1 


and 


»,(1)  -  i , 


Ml    a.s.       Pid-Pi' 


as    L  +  °°.      Therefore,    as    L  +  °°, 


3-5.2 


M  .         p  .  ( 1  -  p  .  ) 

-L     ^^   J   y  J-  (3.5.3) 


for  j  =  ]  ,  2  ,  .  .  .  , k . 

Through  a  Central  Limit  Theorem  result  of  Pyke  and 
Schaufele  [18]  for  functions  defined  on  a  Markov  Renewal 
Process,  it  will  be  shown  that  each  M . ,  suitably  normed, 
will  have  an  asymptotic  normal  distribution,  and  that  the 
joint  asymptotic  distribution  of  the  vector  M,  suitably 
normed,  will  be  multivariate  normal  with  a  closed-form 
expression  for  the  variance-covariance  matrix. 
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*■  le    Distrlbut  i  on    of   K.  A' 


From  Theorem   7.1(b)    and   Lemma    '• .  2    of   Pyke   and   Schaufele 


Lib  j, 


1       Y     I    r 

•/l  ,y  -ij 


x.p.ci  -n 


J 


(3.6.1 


distribution  as  L ->  °°  to  a  standard  normal 


r a n d o m  v a r i a b le,  provided  t 


:e  of  each  term  in 


1)  is  finite.   Recalling  tha 


V  f 

L     '   -• 
i  —  i  - 


ar;i  ce:/'n' 


.1)  becomes 


P  .;  V  1  ~  P  •  J 


In. 

/f   J        r   1 

I  ~L   '".]  ,.-P  TT 


lewal    theory    results    give 


/:, 


Thus,  workinr;  wish  (3-  6.1)  ultimately  yields  the 
f  M .     1 
r« ii*ty  cr   /L"i--'-1--C  ! 

J    !  L   Mj  ' 

.  ■.•;i*:h.out  loss  of  generality,  focus  attention  on 
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hil  =  fil  "  XiCl 


Lemma  4.2  of  Pyke  and  Schaufele  [18]  yields  the  asymptotic 
variance  expression  for  (3.6.1)  as 


-1 


(2),v,   ^(D 


bi  "^i  ir;(hn 


(3.6.2) 


+  2l      I        I     \   ^1m(h   )r  (h,n)m. 


(1)" 


(J   ) 
1  rar 


Following  the  previous  definition  of  terms, 


Vhil}    =     jhil(X)pJmdP 


]m 


=    %.     (f .  i  )    -    C..MP 


Pm-yclPril      j  =  i,m/i 

-yC,p      otherwise, 


(3.6.3 


^2)(hin)  - 

jm    11 


f  [f ...  -X.A-,  ]2p.  dP. 
J   ii    i  1    jm   j  m 


E-Jm><rU)-?Cl^M,„(f1P+Ck"(2) 


jm    i.  i      i'  -j 
where  p v L  '  = a2  +  yL .   Therefore 


J'n         '2(2)  U.0.4; 

ClPmy  "  otherwise  , 
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J       ll  m    Jm      2l 

r 


-yC1  j  ^  i 

~P2  -yC1        j  =  1 


(3.6.5) 


and 


Cj2)(hil>    =     P^)(hil 


c.u<2> 


j   5*1 


(1  -  pn  )  +  C?M(2)        i=l 


(1  -  P2)   -  2C1P(1  -  p     )  +  U1M  '  j 


(3-6.6) 


Nov;, 


(Dr(2) 


?mj       CJ^(h1]L)    =    l-p1-2C1M(l-p1)+    ^C2y(2) 


JP1 


=   1-p     -2Cn(l-p    )  +  -i- (3.6.7) 


and 


2M       I     ^im(hil^r(hil)ini1)]i 


j         1   mr 


2     I       I    (pm-yClpm)(^Cl) 


—  (1  -  6    .  )  +  6 
P-,  rl  mr 


+  .L  II(-,ca)(-»ci»; 


jVl  m^l  r^l 


—  (1  -6    ,  )  +  6 

P-,  rl  mr 


which  reduces  to 
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-2yC 


i  -P-,  -yc 


d-P^" 


i  M"i  p. 


3.6.8) 


Substituting  into  (3.6.2), 


h   y 


-  p1  -  2C1u(l-p  )  + 


c2y(2)  2yCl 


1-Pl- 


yC1(l-p1)" 


=  cl[Pl2  +  (i-Pl)2]+c2 


rM(2j 


-  2y 


(3.6.9) 


Therefore,  the  asymptotic  variance 


V*(M  //L)  =  C  [p2  +  (1  -p.)2]  +  c? 

<J  J      J  J  J 


f„(2) 

' 

y 

-  ? 

y 

, 

,   (3.6.10) 


for  j  =  1,2, . . . ,k.   Since  u(?)  < °°,  V* (M . //L)  <  »,  and  there- 
fore, the  asymptotic  distribution  of  each  M.//L  is  normal, 

?         ?   J  9f  (2) 

with  mean  C.  and  variance  C .  [pf  +  (1  -  p  .  )  ]  +  C   - 2i 

J  J   J        J       j  I   u     ' 


3-7   Joint  Distribution  of 


VL 


With  the  univariate  asymptotic  normality  of  each 
M.//L  established,  the  next  step  is  to  prove  the  joint 
normality  of  M//T.   The  argument  used  to  achieve  the  final 
result  is  given  by  Smith  [23]. 

Recalling  h   =f  .  -X^.,  write 


h,  =  (h., 
— j      11 


ik 


for  i  = 2,3, . . .  .   Now  E(h   )  =  0  for  all  i  and  j ,  and  let 
cov(hls,hlt )  = agt  for  s  j*  t .   Note  that  the  a  t  terms  have 


not  yet  been  developed,  but  will  be  used  In  findinp  the 


asymptotic  covarlance  between  M  //L  and  M.//L.   The  term, 


N 


Tj  _  I    h^.//L   will  converge  in  distribution  to  a  normal 


J   ^2  U 
random  variable  with  mean  zero. 


(e1  e2 


For  any  real  vector 


V 


k  k     N   h.  . 

e'T  =  I    e  t  =  I    e   J  -U 

j=l  J  J    j  =  l  J  i=2  /L 


'77!  I   I 


/L  . 


j=l 


.h.  . 
J  ij 


1   1N 

=  7^  I  (e1h11  +  02h.2  +  •• 


/L 


i=2 


+  e,  h..  ) 

k  lk 


Letting 


^(9lhil  +  e2h12  + 


+  e ,  h . .  ) 

k  lk 


and  as  L  ->  °°. 


E(W. )  =  0  , 


V  ( w .  ) 


I    e  v(h  )+2  He  e.cov(h.  ,h..) 

J=l   J      1J     S^t  S  t       IS   It 


* 

'V  6 


Since  e'T  converges  in  distribution  as  L  ->  °°  to  a  normal 
random  variable  with  mean  zero,  one  can  see  by  the  use  of 
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characteristic  functions  that  T  will  have  a  multivariate 
normal  distribution  with  zero  mean  vector.   That  is, 


/L(M/L  -  C)  -^  N(0,V*) 


3. 8   Asymptotic  Variance-Covariance  Matrix  of  M//L 

The  covariance  terms  of  M//L  can  be  obtained  through 
Lemma  4.3  of  Pyke  and  Schaufele  [18],  which  yields  an 
expression  Bgt  similar  to  the  Bh  introduced  earlier,  except 
that  when  s/t,  the  expression  deals  with  the  covariances 
of  hls  and  h±t    functions  instead  of  the  individual  variances 
of  the  h .   terms.   Formally,  let 


st 


h(hi.'h«>VUlrL«J.<hi8'<r<»it> 


+  £.  (h.. )?  (h.  )jm!1)1M 
-jm   it  srv  is"  j   1  mr 


3-8.1) 


where 


V^s^it5  =  j^s^tPjm^jm 

=  -jXi[fisCt  +  fitCs]Pjmd 
+  |  XiCsCtPjmdPj 


jm   jm 


/2)CsCtPm    j^s,t  or  j=m=s 

J  =  m  =  t 

y(?)°sCtPm"ljCsPm    J=s,m*s  (3-8.2) 
M(2)CsCtPrn-uCtpm   j-t.m^t 


and 


C,(h   h   )  =  1$.    (h.  ,h,. 
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(       (2) 
,J    CsCt 


J  ¥  s ,  t 


=  J 


U   yCsCt  -yCs(l  -Pg)   j  =s   (3.8.3 


UV  'C  C.  -yC,  (1  -p.  )    j  =t 


st   M  t 


;o  that 


£  £ ,  ( h ,  ,  h , ,  )  m . 

L-     J   l  s '  1 1   j 


(2) 


J*1   ^  C,-J--yCR(l-pQ)-^-yC,(l-p,)-t 


S  t  p     '   :: 


s  Px   t   ^ty  P1 


=  ^TClj(2)csct  -VjCsps(1  -Ps}  -vC^P^d  -Pj].  (3-8.4 


tl't 


Now, 


I  I   I  U.  (h   k  (h.,  )  +  E.  (h..)c  (h.  )]m(; 
j  m^l  r/1   Jm   1S   :  xt  ^im   xt   r   1S    J 


(1) 


1  mr 


j  J1Jia"UC=P".  +  P"'SJS(1-6-,,(-,,Ct  +  (1-pt)6rt} 


pl 

Pr 
—   1  - 

pl 

-6    .  )  +  6 
rl           mr 

'n 


2n 


=  I     I       I  Cm  C_C  p  -yC  p  (1 -p  )6  ,  -yC.p  6.  (1-6   ) 
i  m^i  >wi     s  t  m    s  m     t   rt    t*m  is     ms 


j  m^l  r^l 


+  Pm(1-Pt>V1-^s)^t+^\Vn1-^^1-Ps)6] 


-^.Pm^f.d-^)  +P,Jl-Pj6-f6„  (l-6mJ] 


s  ra  jt      mt 


s   j  t  r  i 


mt 


P 


Pr 

—  (1  -  6    ,)    +    6 

P-,      rl'     mr 


In  breaking  up  the  expression  into  a  series  of  fourteen 
triple  sums,  and  summing  each  up  under  restricted  terms  as 
defined  by  the  6  functions,  the  whole  expression  simplifies 
to 


M     ?  2 

— cc  pr  +  c, p  ], 

p,       s^t    t*s  » 


(3-8.5) 


so  that  the  asymptotic  covariance  term, 


M    M 


[VL   '  /Lj 


lJllBst 


=  y(2)CsCt-p(c2+c^)+CsP;+ctp2.  (3.8.6) 


Therefore,  the  asymptotic  variance-covariance  matrix,  D  , 
of  M//L  is  given  by 


dL  =  cj[pj  +  (i-pj)2]+ci 


M(?) 


(3.8.7) 


dst    =   CsPt+CtPs+ljC2)csCt-^(Cs+Ct2) 


(3.8.8) 


for    s/t 


I\? 


3 • 9   Variance  Approximations 

In  using  L./L  to  estimate  P    recall  from  (3.4.4) 
that  the  asymptotic  variance  of  L./L  is 


pjd-Pj) 


1  + 


2 


Pj.(l-P.) 


1  + 


E(N*) 


Under  the  assumption  that  the  individual  cells  are  not 
distinguishable,  a  value  for  N   would  be  difficult  to 
obtain.   Since 


*L  iJU  pJ(1-p.l' 

L 


U 


from  (3-5.1),  both 


(i) 


J  =  l 


M. 
J 


1  - 


and 


(ii) 


k 

j  =  l  l 


k  (L. 


1-  I 
j  =  l 


will  be  consistent  estimators  for  E(N  ),  as  would  each 
individual  estimator, 
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L. 
JL 

L 

f         L." 

for  j=l,2,...,k.   A  comparison  of  the  variances  for  each 
of  these  estimators  could  be  obtained  using  the  results  of 
(3-8.7)  and  (3.8.8),  but  this  becomes  mathematically  com- 
plex.  It  would  appear  that 


k 

I     M 
J  =  l 


k 

1-  I 

j  =  l 


(L.) 
A 
L 


would  be  the  most  stable  of  the  available  estimators,  since 

it  utilizes  the  sampling  information  from  all  k  phases,  and 
if  Ct.  !;> 

should  stay  fairly  constant,  even  over  widely 


also  I 
j=l 


L 
JL 

L 


ranging  values  for  each  L./L. 

It  follows  that  a  reasonable  consistent  estimator  of 
* 
v . .  would  be 
JJ 


JJ 


L. 

h, 

J 

1 

J 

i  +  K 

L 

L 

2 

M 

k 

I  M 
s=l  ° 


k 

1-  I 
s  =  l 


(3-9.1) 


It  still  remains  to  find  an  estimate  for  a2/u2,  or  at  least 
to  get  working  approximations  .to  its  value. 
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A  lower  bound  on  the  variance  term  would  exist  for  a 

cell  division  resulting  in  a  system  of  cells  very  close  in 

2 
shape  and  size,  so  that  y   would  be  considerably  greater 

2  2 

than  o  ,  resulting  in  o /u   approaching  zero. 

One  possible  way  to  obtain  estimates  for  \i    and  a2  is 

to  use  the  information  available  from  the  different  segments 

of  different  phases  on  the  sampling  transect.   Let  N  .  =  the 

nj 

number  of  cell  intercepts  composing  the  n  segment  of  phase 
j.  Recall  from  before  that  N  .  has  a  geometric  distribution 
with 


E(N  ,)  = 


nj     .  -  p 


V(N  .  )  =  ^ 

I1J     (l.-P,)2 


Therefore,  writing  Z  .  ,  the  length  of  the  nth  segment  of 
phase  j ,  as 


a  .  +  N  . 
nj    nj 

z  .  =       y      x.  , 

i  =  a  .  + 1 
nj 


where  a    =  the  number  of  cells  preceding  the  n    segment 


of  phase  j , 


V,    =    E  (  Z  .  )  =  E  ( N  .  )  \i    =  T-i 

J       nj        nj      1  -  p 


and 


°j    =   V(Znj>    -   V«V>    +   E(Nn.aM 


115 


Zi" 


2 


U-PjV 


2         1-F 

Rewriting   the    first    equation   as 


(3.9.2) 


Cl-Pj)E(ZnJ), 


u  can  be  estimated  from  each  color  by  using  the  average 

L. 
length  of  a  phase  j  segment,  Y    multiplied  by  (1--J-). 

J  J-j 

Thus  a  weighted  average  of  estimates  of  u  over  all  colors 

k 
would  yield  a  consistent  estimate  for  y.   Since  I    up.  =  u, 

one  possible  such  estimate  would  be 


k 

I 


fT-    , 

J 

I  L  J 

-^-    1 


Y. 
J 


(3-9.3) 


Rewriting  (3-9.2)  as 


2 


P  •  u 
a2  =  V(Z  .  )(1  -  p.)  -T—J 


would  yield  an  estimate  of  a2  by  estimating  p.  with  L./L 

and  u  by  (3-9-3),  and  using  the  sample  variance  of  the 

length  of  the  phase  j  segments  as  an  estimate  for  V(Z  .). 

nj 


3-10   The  Results  of  Milliard  and  Cahn 

The  method  just  suggested  in  the  previous  section  is 
used  by  Hilliard  and  Cahn  [8],  who  are  responsible  for  many 
of  the  applied  results  in  volume-fractions  analysis.   While 
their  results  are  only  derived  for  k=2  phases,  a  similar 
argument  could  extend  their  results  for  k>2  phases.   To 
compare  the  earlier  results  with  Hilliard  and  Cahn's  results 
for  k  =  2,  let  p}  =  p,  p2  =  q  =  1  -  p,  so  that 


1    1  -  p  ' 


PP 


and 


1     x-p  '  (i-p)2 


a%  .  si   +  (I-P)li' 

2    P      r2 


Therefore,  by  breaking  down  the  asymptotic  variance  for 
k  =  2  into  an  expression  using  the  means  and  variances  of 
the  segments, 


p(l-p)  1  +^_ 

v*.  =  J H_i 

J  J  L 


P2(l-P)2il+^ 


17 


P2(l- 

pr 

L 

u1  + 

y2 

P?(l- 

p)2 

L 

y1  + 

M2 

P2(l- 

P)2 

eo^ 

) 

2_LH 

2 


2  1 


(1  -p)o2  +pp2    pa2  +  (1  -  p)yj2 
2       +        2 


i"1 

2 


'2 


(3.10.1) 


which  is  the  Hilliard-Cahn  result.   The  sample  estimates 
gathered  from  the  line  segments  would  serve  to  estimate 
U1,  P2,  a2,  and  a|,  while  L^/L  and  M   estimate  p  and  E(M,  ) 
respectively.   This  approach  for  estimating  the  variance 
would  probably  yield  accurate  results  for  a  random  model 
with  many  shaded  features,  thus  'a] lowing  for  many  segments 
of  both  colors  on  the  line  segment  for  estimating  the  seg- 
ment parameters. 

It  should  also  be  noted  that  the  Hilliard-Cahn  result 
can  be  obtained  by  looking  at  the  segments  on  the  line 
transect  as  an  alternating  renewal  process.   By  defining 

Yi  =  length  of  the  ith  segment  of  phase  1 

2  th 

Y1  =  length  of  the  i  '   segment  of  phase  2 

the  Y's  are  i.i.d.  with  mean  y.  ,  and  variance  a2,  for 


j  =1,2.   With 


'4  8 


Ml  1 

r  I     ^ 

L  L 


a   variance   approximation    given   by    Smith    [23]    yields 

,2 


V 


L 


1 


L  ( JJ 1  +  y 


°1  "  2°1 


1 +  (a;  +  a; 

M1  +  y2  l 


M- 


yx+y2 


J 


and 


L<-Pl  +  y2^' 


)2l 


1  - 


P  2  +  pi2        I  ^  1  +  v  2 


+  o; 


V-j_  +  y  2- 


L(y1>  p2  ) 


Vl+v2 


+   a. 


2  I U  ^  +  y  2  J 


2] 


2    2 
1-1-^2 


L  ( y  1  +  y  2  )         y 


°1    ,   °1 

3  5 


'2     . 


With 


y-,    +  Mn     ' 
1  c 


1   -P     = 


y-L  +  y2 


E(Mn  )    =    x 

i         y ,  +  y  2 
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the  above  reduces  to 


2M    2, 

p  ( 1  -  p  ) 


'i 


'2 


which  is  (3.10.1)  . 

For  situations  in  which  the  Hilliard-Cahn  procedure 
may  yield  innaccurate  variance  estimates  due  to  a  small 
number  of  segments  on  the  transect,  it  may  be  helpful  to 
study  a  possible  model  for  the  cell  structure  in  order  to 
get  estimates  for  u  and  a2.   The  next  two  sections  present 
possible  models  for  consideration  in  such  a  situation. 


3-11   Foisson  Lines  Model 

The  mosaic  that  arises  from  a  Poisson  Process,  known 
as  the  Poisson  Lines  Model,  allows  for  the  random  division 
of  space  by  a  set  of  random  lines,  as  detailed  in  Section 
2.k.  Letting  t  be  the  density  of  the  random  lines  on  the 
process,  a  direct  result  of  Miles  [13]  is  that  the  inter- 
section points  of  the  line  transect  with  the  random  lines 

2  T 

forms  a  Poisson  Process  of  density  A  =  — .   Therefore,  the 

distribution  of  the  X  '  s  will  be  exponential  with  parameter 

A.   With  the  exponential  distribution,  u  =  a2 =  ~,    so  that 

A^ 


.1,1 


2p.(l  -p.) 

J J 

E(N  ) 


(3-11.1) 
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where  p..  can  be  estimated  by  L./L  and  E(N  )  approximated, 
as  mentioned  before,  by 


k 

s=l     S 

k 

r, 

2 

1  - 

"J, 

which  generalizes  Scheaffer's  [20,21]  results  for  k=2. 
Numerical  results  will  be  presented  in  Chapter  7,  which 
will  allow  for  comparisons  between  this  approximation  and 
the  Hilliard-Cahn  technique  from  the  previous  section. 

3-12   Meijering  Cell  Model 

The  Meijering  Cell  Model,  as  developed  in  Section  2.5, 
also  results  in  a  set  of  convex  polygons  with  identical 
size  distributions  on  the  mosaic.   The  means  and  variances 
of  the  cell  sizes  are  tabulated  by  Gilbert  [6]  for  two  and 
three-dimensional  mosaics  sampled  by  areal  sections  and 
lineal  transects.   For  a  two-dimensional  model  generated 
by  a  point  process  of  intensity  t,  sampled  by  a  lineal 
transect,  the  intercept  lengths  of  the  line  with  the  indi- 
vidual  cells  will  have  a  mean  u=1.027x  2  and  variance 
o2  =  .188t~  .   Therefore,  for  this  model, 


,).l 


1.18Pj(1  -Pj) 
E(N*) 


where  E(N  )  can  be  approximated  as  before 
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3-13   Tests  on  the  Mosaic  Using  Segment  Counts 

Consider  the  placing  of  multiple  transects  inside  the 
boundaries  of  the  mosaic.   The  measurements  of  interest 
include  the  individual  segment  counts  for  each  phase  from 
each  transect.   The  asymptotic  properties  of  the  segments 
were  developed  in  Section  3.8.   This  section  will  utilize 
the  results  from  Section  3.8  to  develop  a  test  for  the 
equality  of  mean  segment  counts,  over  all  transects. 
Applications  for  a  test  of  this  kind  would  arise  for  feature 
orientation  and  phase  homogeneity.   The  test  for  orientation 
would  consist  of  independently  placing  n  transects  of  length 
L  at  differing  angles  across  the  mosaic.   If,  for  individual 
colors,  the  number  of  segments  per  unit  length  varies  signi- 
ficantly from  transect  to  transect,  then  a  possible  color 
orientation  might  exist. 

A  segregation  test  would  involve  the  same  ideas,  except 
that  the  n  transects  would  be  placed  in  a  parallel  grid 
across  the  mosaic. 

Let 


J  ,m 


the  number  of  segments  of   phase  j 
on  the  m-th  transect 


and  let 


M'  =  (W.    M0    •••  M,    ) 
-ni      1 ,  m   2  ,  m      k ,  m  ' 


The  general  test,  which  follows  Scheaffer's  [19]  test  for 
k  =  2,  is  set  up  as  follows: 


52 


Hq:   E(M1)=E(M2)=     =  E(Mn) 

H  :   There  is  at  least  one  j'  such  that 
E(M .,  )  *E(M.)  for  j  jt  j'. 

J  J 

Recall  from  Section  3-8,  that  if  H   is  true,  -i(M  _lc)  -S- 

o        '  /L  -m  — 

Nk(0,D  )  as  L-x»,  so  that  under  H  .  ^(Mm~LC ) ' (D* )_1 (M  -LC) 
has  asymptotically  a  central  x?~  distribution  with  k  degrees 
of  freedom.   Let 


1   n 

h  =  -   y   h 

—      n  L.  — r 


n   '-,  -m 
m=l 


be  the  vector  of  averages  for  the  number  of  segments  for 

each  phase  over  the  n  transects.   Then  -4-(M-LC)  also  has 

v  h    —      — 

an  asymptotic  multivariate  normal  distribution  with  zero 

1     & 

mean  vector  and  variance-covariance  matrix  -  D  .   Thus  bv 

n  J 

writing 


M  -LC  =  (M  -  M)  +  (M-LC)  , 

— m   —     m   —    -     ' 


it  can  be  shown,  using  Cochran's  Theorem,  that  under  H  , 

o 

1   n  *  -i 

T  =  f   I  (M  -M)'(r>  )  J-(M  _m) 
L  m^1  -m   -         -m   - 

has  an  asymptotic  x2  distribution  as  L  -►  °°  with  (n-l)k 
degrees  of  freedom.   Therefore,  the  test  statistic  will  be 


T  =  4-  I    (M  -M)'(D*)  X(M  -M), 
L    t  — m   —         — m   —  ' 
m  =  l 
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where  D   must  be  a  consistent  estimator  for  D  .   Recall 
that 


and 


d  .  . 
J  j 


C.[Pj  +  (l-p.)^  +  cj 


(2) 


-2y 


st 


CsPt  +  CtPs^(?)CsCt-u(C^C^) 


for  s^t,  so  that  p  could  be  estimated  by  the  total  frac- 
tion of  phase  j  over  all  n  transects,  and  \x  and  y  ^ 2 '  esti- 
mated from  the  segment  information,  as  developed  in  Section 

3-9.   Since  M./L  is  consistent  for  C,  the  total  number  of 
J  3 

phase  j  segments  per  unit  length  should  serve  as  an  esti- 

* 

mate  for  C  .   With  D   approximated  using  the  terms  from 

above,  the  test  for  equal  means  would  have  a  rejection 
region  of  the  form 


T  >  y  2, 

X(n-l)k,a 


Recall  from  Section  3.9  and  3-10  the  need  for  the  assump- 
tion of  random  cell  assignment  in  order  to  use  the  variance 
approximation  from  the  Poisson  Lines  Model.   An  orientation 
test  could  be  used  as  a  prelude  before  such  an  approximation 
is  used. 
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3 • 1 ^   Spatial  Covariance  Function  Approach  to  Using  Lineal 
Sampling:   Poisson  Process 

The  approach  discussed  in  Section  2 . 3  to  obtain  vari- 
ance expressions  for  the  areal  ( volume ) -fraction  estimators 
depends  on  obtaining  an  estimate  for  the  covariance  function 
for  points  on  the  mosaic.   When  estimates  of  the  covariance 
function  are  available,  then  the  corresponding  variance  for 
lineal  fractions  is 


f  L .  1 
L 


cov(h)b(h)dh  , 


0 


where  b(h)  represents  the  density  function  for  the  distance 
separating  2  points  randomly  located  on  the  line  transect. 
When  the  sampling  scheme,  S,    is  a  line  transect  of  length  L. 
let  Y1  and  Y 2    represent  the  distance  of  the  two  points  from 
the  beginning  of  the  line.   The  distances  Y,  and  Yp  are 
independently  distributed  as  uniform  random  variables  on 
(0,L),  so  that  for  0  < h  < L, 


P(|Y1  -Y2|  <h)  =  P(Yf2)  -Ym  <h) 


(2) 

L  rh 


0  Y2-h 


2!-Ldh  =  i-Ikzhl 
L  \f 


Therefore 


b(h 


r(l  -  r)    for  0  <  h  <  L 


otherwise 


(3.14.1 
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Now  for  any  covariance  function  representing  the  under- 
lying mosaic,  the  variance  of  L./L  can  be  obtained. 

The  first  model  to  be  considered  is  the  Poisson  Lines 
Model,  already  having  been  defined  in  Section  2.4,  for 
which  the  covariance  function  for  color  j  is 


cov.(h) 


=  Pj(l  -Pj)e 


-Ah 


where  A  is  the  underlying  parameter  of  the  process.   There- 
fore, 


?Pj(i-pj) 


L 


i) 


-Ah   he 


-Ah" 


L 


dh  , 


which  as  L  gets  large,  will  result  in  a  comparable  expres- 
sion to  (3.11.1) . 

3.15   Boolean  Scheme 

The  model  for  the  Boolean  Scheme,  as  developed  in 
detail  in  Section  2.6,  yields  a  covariance  function 

,.»    2  xK(h)    2 

cov(h)  =  q  e      -  q 

for  a  model  generated  from  a  point  process  of  intensity  t. 
When  S  is  a  lineal  transect  of  length  L, 


,-         TK(h)    2.2,,   Ik  n 
[Q2e      -  q  ]  =-  (1  -  -)dh 


One  possible  approach  to  integrating  the  above  expression 
is  to  use  a  Taylor  expansion,  with 
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eTK(h)  -  eTK^+hTK'CO)eTK(0>+o(h) 


-  -[1  +  hxK'(0)]  +  o(h)  , 

where  K(h)  is  defined  as  in  Section  2.6.   Recall  from 
Chapter  2  that  -ttC'(O)  equals  the  expected  boundary  length 
per  unit  area,  which  in  turn  can  be  estimated  by  ~  x  the 
number  of  intersections  of  the  transect  with  the  cell 


boundaries.   With  -irC  '  (  0  )  =  -Trq  xK'  (0  )e 

estimate  for  tK'(O)  will  be 


tK(0) 


=  -TrqxK'  (0),  an 


2L 


-M 
2qL  " 


With  this  approximation  available,  e 
mated  by 


xK(h) 


can  be  approxi- 


1  - 


!qL 


so  that  V 


\LA) 


can  be  approximated  by 


(3-15.1) 


f  fn       hM    2,2,.   h^Jt 
J  (q~2L-q  )L(1-L)dh 


=  jLpq!(i-|)-^f(i-|)dh 


M  ,  M        M 
-  PQ  -  ?  +  3  =  Pq  -  5 


(3.15.2) 


However,  a  problem  with  this  approximation  might  arise  if 
K(h)  = 0  when  h  is  greater  than  some  fixed  distance. 
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A  second  possibility  would  involve  studying  specific 
models  which  could  generate  a  mosaic  from  the  definition 
of  the  Boolean  scheme.   Unfortunately,  many  modeling 
assumptions  result  in  an  expression  for  K(h)  that  makes 


V 


mathematically  complex.   One  of  the  simpler  models 


to  work  with  assumes  each  6.  set  to  be  a  ball  of  radius  r. 

2 

With  K(0)  =  -rrr  ,  It  can  be  shown  using  geometric  results 

mentioned  by  Matern  [11],  that 


K(h) 


K(0)  -  2r2sin  1£-  -  rh 
2r 


- 

,2 

L 

Hr2 

for  0  <  h  <  2i 


otherwise 


Therefore, 


/i -J. 


[qV^-q^U-^dh 


o  2  .  -1  h 
-2r  sin   —  -  rh 
cr 


n 

h2 

1 

\ 

~^7 

qe 


|(1  -£)dh, 


which  is  mathematically  difficult  to  integrate.   An  approxL 
mation  can  be  obtained  through  a  Taylor  expansion  of  the 
term 


n  2   .   -1  h 

-2r  sin   ^—  -  rh 
2r 


1  - 


However,  using  the  approximation  from  (3. 15.1),  V 
can  be  approximated  by 


[2P   2/i   ^   hM,,   hNjl 


2   „  2. 


_  pq(^r)  _  pq8r  "  _  2r'~M  +  8Mr 


3L- 


As  L  gets  large, ^the  second  and  fourth  terms  should  get  very 

1j  , 


small,  so  that  V 


can  be  approximated  by 


pq(^r )  _  2r  M 
L 


L 


(3.15.1) 


Note  that  M  is  a  function  of  L,  so  that  the  second  term  in 
the  expression  is  really  the  same  order  of  L  as  in  the  first 
term,  so  that  as  L  gets  large,  the  two  terms  should  get 
close.   However,  it  should  be  realized  that  the  whole  approx- 
imation is  based  on  the  expansion  of  e   ^   ,  from  which 
estimates  are  only  obtained  for  the  first  two  terms.   If  any 
additional  terms  of  the  expansion  are  significantly  differ- 
ent from  zero,  the  corresponding  variance  estimate  may  not 
yield  an  accurate  approximation. 


CHAPTER  H 
AREAL  SAMPLING 


^  .  1   Introduction 


In  considering  the  parameters  of  the  n-dimensional 
mosaic  as  defined  in  Section  3-1,  it  was  mentioned  that 
several  different  sampling  procedures  would  be  considered 
for  obtaining  information  vital  to  the  estimation  of  various 
properties  of  the  mosaic.   Chapter  3  developed  the  ideas 
behind  the  lineal  sampling  approach,  with  the  asymptotic 
properties  of  the  lineal  estimators  being  developed  from  a 
renewal  theory  argument  and  the  use  of  some  lin.it  theorems 
on  Markov  Renewal  Processes.   This  chapter  will  develop 
estimators  for  the  mosaic  parameters  when  the  sampling  pro- 
cedure Involves  the  random  location  of  a  planar  section  or 
sections  of  fixed  total  area  A  entirely  within  the  borders 
of  the  mosaic.   The  fraction  of  total  area  of  the  planar 
features  of  phase  j  gives  an  unbiased  and  consistent  esti- 
mate of  the  fraction  of  the  jth  phase  on  the  entire  mosaic. 
As  done  in  Chapter  3,  the  next  step  will  be  the  development 
of  the  asymptotic  joint  distribution  for  the  vector  of  areal- 
fraction  estimators,  followed  by  estimation  of  the  variance 
components  through  the  utilization  of  other  data  obtainable 
from  the  areal  sample. 
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^ .2   Asymptotic  Distribution  of  A//A 

Many  of  the  arguments  used  in  Chapter  3  can  be  extended 
and/or  paralleled  to  derive  the  asymptotic  distribution  of 
the  vector  of  areal-f raction  estimators,  assuming  that  the 
sample  is  a  planar  section  of  area  A  randomly  selected  from 
the  mosaic.   Under  the  assumption  of  the  underlying  cell 
model  for  the  construction  of  the  mosaic,  as  developed  in 
Section  3.2,  let 


,th 


X.  =  area  on  the  i  '   cell  on  the  plane, 


E(X. )  =  m  < 


V(X.  )  -  o2  <  oo  } 


1J 


1    if  the  i    cell  is  phase  j 
0    otherwise  , 


N   =  the  number  of  cells  on  the  sample  planar 
section  . 


Again,  assuming  that  the  individual  cells  will  not  generally 
be  distinguishable,  N   will  not  be  available  from  the  data. 
Letting 


A. 
J 


N 

J     <$>  ■  ■  X  •  , 
i-1  1J  2 


Aj  will  represent  the  total  sampled  area  due  to  phase  j 

features  and  A . /A  will  be  a  consistent  estimator  for  p 
J  j 

for  j  =1,2,..., k.   Let  A'  =  (Ax  A2  •••  Ak).   The  approach 

used  to  develop  the  asymptotic  distribution  of  -jt  A   is  to 

/A  — 
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think  of  the  cells  being  dropped  sequentially  and  filling 
up  a  total  area  A,  much  as  the  line  segments  fill  up  the 
lineal  transect  of  length  L.   Assuming  that  the  individual 
cells  will  be  small  in  comparison  to  the  large  sampling 
area,  so  that  of  the  sample  will  consist  a  large  number  of 
cells,  any  border  effect  of  the  areal  sample  will  be  negli- 
gible.  With  the  independent  assignment  of  colors  to  the 
cells,  the  sequential  arrangement  of  cells  will  allow  for 
parallel  arguments  to  the  renewal  theory  derivations  of 
Chapter  3-   In  addition  to  A  /A  being  a  consistent  estimator 
for  pj ,  it  also  is  asymptotically  unbiased,  this  being  a 
direct  result  of  Wald's  Theorem  in  an  analagous  proof  to  the 
lineal  case  of  Section  3.3. 

Furthermore,  the  joint  asymptotic  distribution  of 
■^j(A-p)  will  be  normal  with  zero  mean  vector  and  variance- 
covariance  matrix  VA ,  so  that  the  variance  of  A . /A  can  be 
approximated  for  large  A  by 


P,(l-Pl) 


Vjj  = 


1  + 


A/u 


(4.2.1) 


and   covariance    term, 


-P„Pf    1  + 
**  st 

rst    =    "      -     A/{T 


(4.2.2) 


for  s^t.   Hence  the  next  step  is  to  get  practical  estimates 
for  p  and  o2  from  the  information  available  from  the  sample 
data  . 


4 . 3   Estimation  of  the  Variance  Components 

To  obtain  a  practical  estimate  of  the  terms  of  V**, 
Aj/A  and  l-A./A  serve  as  consistent  estimators  for  p.  and 
1-Pj  respectively,  but  the  other  terms  are  parameters  of 
the  underlying  scheme. 

Some  models  may  arise  in  which  the  cells  are  very  sim- 
ilar in  both  size  and  shape.   In  this  case,  E(X?)/u2  is 

very  close  to  one,  so  a  lower  bound  on  v..  is 

JJ 


A. 
1-4 


A/u 

* 
Since  E(N  )/A  ->■  1/y  as  A  ->  °°,  A/u  can  be  thought  of  as  being 

approximately  equal  to  E(N  ),  and  an  estimate  for  E(N*)  can 

be  obtained,  similar  to  the  one  from  Section  3.9.   Defining 


M,  -  the  number  of  phase  ,1  features 
on  the  sample, 


I     M.  / 
,1=1  J 


1-  I 


is  a  consistent  estimator  of  E(M  ). 

However,  the  feature  count  in  two  dimensions  is  not  as 
easily  obtained  as  the  intercept  count  in  one  dimension 
since  very  irregularly  shaped  phases   make   counting  diffi- 
cult.  Frequently,  the  planar  sample  section  may  have  only 
one  continuous  feature  for  a  certain  phase,  such  as  in 
viewing  a  large  body  of  water  in  a  land  study  situation. 
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Perhaps  a  more  useful  result  could  be  obtained  from  a 
basic  stereological  result  introduced  in  Chapter  2.   Recall 
that  E[boundary  length  for  phase  J  features  ]/A  =  -ttC.  (0), 
where  C, (h)  is  as  defined  in  Section  2.3,  for  phase  j.   For 
this  estimate  to  be  used,  an  underlying  process  for  cell 
modeling  must  be  assumed,  so  that  a  form  for  C.(h)  can  be 
obtained.   The  next  section  discusses  this  estimate  for  a 
frequently  used  model,  the  Poisson  Lines  Model. 

4  •  4   Poisson  Process 

If  the  underlying  process  constitutes  the  realization 
of  a  Poisson  Process  of  density  A  =t/tt,  as  stated  by  Miles 
[131,  several  additional  results  of  Miles  will  help  to  sim- 
plify the  asymptotic  variance  expressions. 

The  individual  cell  areas  have  mean 


*  2 


and  variance 


2   o 

TT   -  2 

2~,    *~W 


and  the  expected  cell  boundary  length  is  2/A 
oVu2  =  (tt2  -  2)/2, 


With 


17  hllshl  -  ,rpj(1-pj) 


jj 


2E(N    ) 


2A(A     )r 


4.4.1) 


Recall  from  Section  3 . 1  i|  that  for  a  Poisson  Lines  Model 
generated  from  a  point  process  of  intensity  x,  the  covar- 
iance  function  of  color  j  is 


h 


cov.(h)  =  p.(l-p,)e  v       =   p.(l-P.)e 


* 

-2A  h 


y~   >j 


j        j 


so  that  the  expected  boundary  length  per  unit  area  of  phase 
j  features  is 


Therefore , 


E(B  ) 

A 


2Ap.(l-Pj) 


( J» .  H  .  2 ) 


B. 

J 

2A^p.  (1  -  p. ) 

u  J 


(4.it. 3) 


gives  an  unbiased  estimate  of  A 
information  over  all  colors, 


Utilizing  the  sampling 


k 


B 


1 


j  =  l 


2Ak7T      k 
j  =  l 


gives  a  consistent  estimate  for  A  .   In  arguments  similar 
to  those  of  Section  3.9,  it  would  seem  that  this  would  give 
a  reasonable  estimate  of  A,  since  the  denominator  expression 


k 
of  I 

j  =  l 
areal-fraction  estimates . 


would  appear  to  be  fairly  stable  over  varying 
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]4 .  5   Meljering  Cell  Model 

For  the  Meljering  Cell  Model  defined  in  Section  2.5, 
Gilbert  [6]  tabled  the  following  mean  and  variance  for  the 
areas  of  cells  arising  from  a  point  process  of  intensity  t 

_  2 

U  =    1.018t  5 
and 


228t 


Therefore,  for  this  model 


**        p.(l-p  ) 

v.  .  =1.  22  -J «L 

J  J 


E(N  ) 


It  should  be  noted  that  for  both  the  lineal  and  areal  cases, 
this  model  yields  less  variability  for  the  areal-f raction 
estimators  than  the  Poisson  Lines  Model,  so  that  it  might  be 
preferred  for  mosaics  with  similar  type  features. 

^•6   Comparisons  with  the  Results  of  Hilliard  and  Cahn 


A  variance  expression  for  A  . /A  is  obtained  by  Hilliard 
and  Cahn  under  the  assumption  of  two  rigid  conditions.   The 
first  assumption  is  that  the  number  of  equal  sized  features 
of  color  j  within  the  given  region  follows  a  Foisson  distri- 
bution.  Second,  the  areal  fraction  for  any  phase  is  assumed 
to  be  very  small.   Their  final  result  is 


(A. 


F(M;  ) 


■taj+"j] 


(1.5.1) 


where  \i.    and  a2,    denote  respectively  the  mean  and  variance  of 
a  phase  j  feature.   Recalling  that  a  feature  may  be  composed 
of  one  or  more  individual  cells,  and  using  the  arguments  of 
Section  3.9,  it  can  be  shown  that 


and 


J         1  -  P 


o.    =    — 


+      >J 


J  (1    ~    P,    ) 


or    equivalently, 


and 


"  =  V)a-P3) 


-    tl-PjlLj-Mj], 


with 


v .  . 


A/u 


1  +  -? 

\1 


1  +  ^- 
2 
u 


E(M .)       2 

f-[p     +o2] 

A" 


E(M .)  2 

^L[a?+u:  +  {(-3Pi+2ppu"-p.ap] 


(1.6.2) 
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which  reduces  to  (Jl.6.1)  when  p.  is  very  small.   As  a  result, 
the  variance  expression  obtained  by  Milliard  and  Cahn  repre- 
sents a  special  case  of  an  asymptotic  variance  from  a  more 
generalized  model.   The  result  of  ('1.6.1)  may  be  a  useful 
expression  if  there  are  a  few  individual  features  from  which 
we  can  obtain  areal  measurements  within  the  sampling  plane. 
The  sample  mean  and  variance  can  then  be  used  as  consistent 
estimators  for  u .  and  a?  respectively. 

J  J 

^ • 7   Spatial  Covariance  Function  Approach 

Recall  from  Section  2.2  that  if  S  represents  the 
sampling  scheme  on  the  mosaic,  then  the  variance  of  the 
areal-fraction  estimate  is 


cov(h)bc(h)  , 


where  bg(h)  is  the  density  function  for  h,  the  distance 
between  2  points  randomly  located  on  S.   Even  for  a  recog- 
nizable covariance  function,  the  variance  term  may  be 
extremely  complicated  mathematically.   The  one  modelling 
scheme  that  has  been  studied  in  the  results  of  Chapter  3 
and  4  has  been  the  Poisson  Lines  Model,  in  which  for  color 


cov.(h)  =  p  (l  -  p  )e  c 
J       J      J 


A  h 


In  studying  the  areal-sampling  approach  to  volume  fraction 
estimation,  let  S  represent  a  convex  set  of  total  area  A  on 


the  plane.   A  geometrical  result  due  to  Borel  and  given  by 
Matern  [11]  is  that 


~,   2C  h2      _ 
bs(h)  =  ^-_J^_  +  0(h2) 


(4.7.1) 


as  h  -*  0,  where 


C   =  total  perimeter  of  boundary 
length  of  S. 


Note  that  this  approximation  is  only  applicable  when  h  is 
small,  but  as  h  gets  large,  cov . (h )  = p  .  ( 1  -  p  .  )e_2A  h  gets 

<J  J  J 

very  small.   Also,  as  A  gets  large,  the  second  term  of 


bc,(b)  becomes  very  small,  so  that  V 
by 


can  be  approximated 


2TTp   (1-p   )   p.       * 

■''   '  he   A  hdh 


P.1(l-p.1) 


*  ? 
2(A  )  A 


yielding  the  same  result  as  (HA.l). 


CHAPTER  5 
POINT  SAMPLING 


5-1   lntroducti< 


The  final  method  of  estimation  to  be  discussed  involves 
the  use  of  point  sampling  on  the  mosaic.   By  placing  points 
within  the  boundaries  of  the  mosaic,  the  fraction  of  points 
falling  upon  a  certain  phase  can  be  used  as  an  estimator  of 
the  phase  proportion.   Koop  [9]  has  discussed  21  methods  of 
point  sampling  for  the  two-phase  planar  mosaic.   Several  of 
the  simpler  techniques  will  be  discussed  and  extended  for 
the  general  k -phased  mosaic.   Also,  the  asymptotic  distri- 
bution for  the  phase  proportion  estimators  will  be  developed 
under  restricted  conditions. 


5.2   Random  Points  on  the  Plane 


Nor  a  planar  mosaic  of  area  A,  let  A^A^...^,  denote 
the  total  area  on  the  mosaic  for  phases  1,2,... ,k  respec- 


tively, such  that  I    A.  =A.   As  noted  earlier  1 


j  =  l 


in  Chapter  3, 


the  mosaic  itself  may  represent  a  planar  section  of  a  two 
or  three-dimensional  population  figure,  for  which  each  A . /A 
would  serve  as  an  estimator  for  the  population  proportion 
of  phase  j.   Nor  a  point  located  at  random  within  the 
boundaries  of  the  planar  mosaic,  the  probability  that  the 
point  falls  on  a  phase  j  feature  is  A. /A  =  P..   Random 
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sampling  involves  the  independent  location  of  a  fixed 
number,  n,  of  points  at  random.   Let  M.  equal  the  number 
of  the  n  points  belonging  to  phase  j  features  for  j=l,2,. 


..,k.   The  distribution  of  N'  =  (N   N 


N  )  is  multi- 


nomial with  probabilities  F  , . . . ,F  .   This  is,  of  course, 
a  standard  result  of  random  point  sampling.   Individually, 
N./n  is  an  unbiased  estimator  of  P.,  with 


fN  1 


Y^V 


(5-2.1) 


and 


fN    N,  )         -F  F 
s    t 


(5.2.2) 


for  s/t,   As  n  gets  large,  the  distribution  of  N/7n 
approaches  normality.   For  random  points  sampled  from  a 


lineal  transect  with  phase  proportions 


L, 


L, 


L  '  *  •  •  '  L 


respec- 


tively, the  results  will  be  similar,  except  that  L./L  will 

J 

be  used  instead  of  F.. 
J 

5  •  3   Systematic  Points  on  a  Line 

A  second  method  of  point  sampling  that  is  widely  used 
in  many  different  kinds  of  experiments  involves  systematic 
sampling  of  points  on  one  or  more  lines  passing  through  the 
mosaic.   This  method  has  some  practical  advantages  In  its 
use  over  the  random  sampling  technique.   It  generally  will 
offer  the  experimenter  a  more  ccnvpnient  design  or  layout 
of  points  on  the  mosaic,  and  estimators  obtained  from 
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systematic  point  counts  also  are  unbiased,  with  variances 
inversely  proportional  to  n. 

For  an  individual  line,  denote  the  lineal  fraction  of 
phase  j  by  L./L,  as  defined  in  Chapter  3.   It  should  be 
mentioned  that  two  possible  situations  may  exist  to  allow 
for  systematic  sampling.   The  first  will  occur  if  the  line 
by  itself  constitutes  a  one-dimensional  mosaic,  in  which  the 
vector  L  is  constant.   The  other  situation  arises  when  the 
transect  is  randomly  located  in  a  higher-dimensioned  mosaic, 
as  discussed  in  Chapter  3.   This  will  be  the  case  under 
consideration  in  this  section,  so  that  for  each  transect, 
the  values  for  {L  }     will  be  random,  with  L./L  repre- 
senting only  an  estimator  of  F . .   The  process  of  locating  a 
fixed  number,  n,  of  points  on  the  line  will  start  by  dividing 
the  line  into  n  intervals,  each  •  interval  of  width  A  =  L/n. 
The  first  point  is  then  placed  at  random  in  the  first  inter- 
val, with  each  succeeding  point  a  distance  A  from  its  near- 
est neighbors,  as  defined  by  Cox  [3]. 
Define 

1    If  the  I    point  on  the  line 


u.  . 


belongs  to  a  phase  j  segment 
0    otherwise. 


Due  to  the  random  location  of  the  line  and  the  first  point, 
E(u..)=F.  and  V(u.  . )  =  F. (1  -  F.  )  .   Letting 

n       n     ' 
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N.l 


(  _Fj'  so  that  N./n  would  appear  to  be  a  reasonable 


estimator  of  F .,  since  it  is  unbiased 
it  is  first  necessary  to  develop  the 


To  look  at  V 


Define 


COV<uij'ui+s,j>=  E<uij'»i+s,j>-  V 


Tr(h)  =  P(2  points  a  distance  h  apart  belong 
to  the  same  cell ) . 


Now, 


E(uU'ui+a,j>  "  pl»ir1'ui+:3ml) 


Therefore 


so  that, 


=  P(u   =  l)P(u, .       .  =  1  |  u. .  = 1) 

=  F.[tt(sA)  +  {1  -  tt(sA)}F.] 
=  F.(l  -  F.)tt(sA)  +p^  . 


cov(u  .  ,  u.    ,  )  =  F  (1-F .  )tt(sA)       (5-3.1) 
±<J         J-  u  >  J      J      J 


Var 


F.(l-F  )        n 

— —  +  -p  I     F,(l-F.)(n-m)7r(mA).(5.3.2) 

n   m=l  d     J 


To  make  use  of  the  usual  inferential  techniques,  an  esti- 
mate for  tt(itiA)  must  be  found.   The  approach  is  intuitively 
parallel  to  the  derivation  of  Section  3.9. 
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5.4   Estimation  of  the  Vari; 


Define 


4 


1    if  u-  .  =  1  and  u+.  •  =  0 
sj        utj 

0    otherwise. 


Then, 


n-s 


i=l 


i ,  i+s 


can  be  used  to  study  the  transition  out  of  phase  j  for 
points  sA  apart. 


E(MJ)  =  (n-s)E(^'     ) 

=  (n-S)P(usJ-l)P(utJ-0|usJ«l) 

=  (n  -  s)F  [1  -  tt(sA)](1  -F.) 

=  (n  -  s)F  (1  -F.)[l  -  tt(sA)] 


(5.4.1) 


so  that 


tt(sA)  =  1  - 


E  ( MJ'  ) 

s 

In  -  s)F  (1  -F.)  ' 
J      J 


Therefore,  V 


I N . ) 
n 


can  be  estimated  by 


(5.^.2) 


fN,] 

N.l 

— 

1 

J 

I"  J 

n 

N. 

n 


2 


£  (n  -  m) 
m=l 


1  - 


s 

M. 

N. 

(n 

-m) 

n 

1 

_  _J_ 
n 
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1  - 


n  n-1  . 

--2  i  A 

n     s  =  l 


(5.4.3) 


Note  that  as  n  gets  large,  A  becomes  smaller,  so  that  the 
points  get  closer  and  closer  on  the  line,  allowing  for  more 
transitions  in  and  out  of  a  particular  phase.   However,  in 
looking  at  (5-4.3),  each  variance  approximation  uses  an 
estimate  of  tt(sA)  in  terms  of  the  individual  phase  j  . 
Rather  than  use  different  estimates  in  each  individual 
variance  expression,  it  would  seem  far  more  reasonable  to 
utilize  all  of  the  information  in  some  form  of  weighted 
expression  that  could  be  used  for  all  variance  and  covar- 
iance  terms,  independent  of  the  particular  phase  under 
analysis.   Following  the  same  line  of  reasoning  employed  in 
the  lineal  case, 


(n  -  s)tt(sA)P  (1  -F  )  =  (n  -  s)F.(l  -  F.  )  -  E(Mj) 
J      J  J      J       s 


for  j  =  1,2, . . . ,k,  so  that 


(n  -  s  )tt(sA) 


1-  [F 
j  =  l  J 


=  (n  -  s 


-  I   e(m*) 


resulting  in 


tt(sA)  =  1  - 


(n  -  s) 


t = i 

k 
t  =  l 

^  n 

?~ 

(5.4.4) 
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as  a  stable  estimate  for  n(sA).   Averaging  over  all  values 
'  N ,  ' 


of  s  ,  V 


can  now  be  estimated  by 


(N.l 

r 

N,l 

■' 

i 

J_ 

',  «  ) 

n 

+  --, 

n' 


(V 

.  n  . 

V 

n-3 

J 
n 

I 

1  - 


t  =  l  S 


1-1 

t=l 


(5.4.5) 


5 . 5   Covariance  Between  Colors 

If  comparisons  are  to  be  made  between  colors,  the  covar- 
iance term  needs  to  be  established.   For  s^t, 


n   n 


1 

- 

2 

E 

n 

n 

1     u, 

1  =  1  2 


n 

£  uit 
1=1  1T: 


n 


n  1 

y  u.  u.,  +   y  u.    u  , 

,•_-,   is   it    .  fj   is   mt 

i-l  i^m 


The  first  term  of  the  above  expression  is  zero,  since  a 
point  cannot  be  in  two  different  colors.   Thus, 


N      N. 
s      t 


~?      I      P(uis  =  1   >  umt  =  1) 
n      i#n  1S  mt 


P     n-1 

-P       J      F    [1  -TT(mA)]F,  (n  -  m) 
n      m=l  L 


F    F. 
s    t 


n-1  n-1 

I    (n  -  m)  -     I    (n  -  m)Tr(mA) 
m=l  m=l 


^Vt-4-psptnt(n-m)7rfmA) 

n  m=l 


(5.5.1) 
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Therefore, 

'N    N 


~FsFt 

n 


n-1 


-TF,Ft  I    (n-m)ir(mA)     (5.5-2) 


2  J  s^t 
n      m=l 


with  estimation  of  TT(mA)  from  (  5  .  4 .  l\  ) 


5.6   Asymptotic  Distribution  of  the  Estimates 


It  will  be  shown  that  the  distribution  of  -r*  N  will  be 

/n  — 

asymptotically  normal  under  the  assumption  of  the  develop- 
ment of  the  mosaic  from  a  Poisson  Lines  Model.   Unlike  the 
other  areal  ( volume ) -fraction  estimators  that  have  been 
obtained  through  lineal,  areal,  and  random  point  measure- 
ments the  systematic  point  count  estimators  cannot  be  writ- 
ten as  an  average  of  independent  indicator  functions.   As 
discussed  in  Sections  5-3  and  5.4,  the  systematic  points  on 

the  line  are  dependent  with  covariance  function  cov.(h)  = 

J 

F.  (1  -  P,  )ir(h)  for  points  h  units  apart.   Therefore,  the 
asymptotic  normality  of  the  vector  of  estimates  must  be 
achieved  through  a  Central  Limit  Theorem  for  dependent  ran- 
dom variables. 

The  theorem  to  be  used  is  a  multivariate  result  of 
Hannan  [7],  stated  as  follows: 


Let  X15X2,...,Xn  be  k x 1  vectors,  defined  by 


Xll 


"12 


X.. 

lk 


Si-Fii 

u.  2  -  F2 


u  . ,  -  F. 
ik    k 
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such  that 

(i)   Each  vector  of  observations  satisfies  the  strong 
mixing  property, 
(ii)      u. 


rsw   1 


(m  ,m?)  =  E(X.   X   a  X    ) 
lr   m, s   rruw7 


for  m1  and  m?    integers  such  that  1  <  m  <  rru  <  n 
is  finite  when  summed  over  all  possible  values 
for  m,  and  m0. 
(iii)   The  covariance  spectrum,  [f(e)]  is  continuous  at 
e=0  and  trace  [f(e)]  is  uniformly  bounded, 
where 


frs(£)=~ 


e  lh£C   (h)dh  . 
rs 


(5.6.1) 


Then,    the    vector 


fNl 
1T-Fi 

N 

1T-P2 


—  -F, 
n  k 


will  be  asymptotically  normal  as  n  ■*■  °°  with  vari- 

ance-covariance  matrix  2ir[f(0)]. 
In  the  general  case,  the  first  condition  (i)  will  not  neces- 
sarily be  true  and  can  be  extremely  tedious  to  prove  from  a 
mathematical  approach.   However,  under  the  assumption  of  a 
Poisson  Lines  Model  used  for  the  development  of  the  cell 


7! 


structure  of  the  mosaic  (Section  2 ,H )  ,  the  number  of  inter- 
sections the  transect  makes  with  the  cell  boundaries  has  a 
Poisson  distribution.   With  this  additional  property,  the 
combination  of  two  theorems  of  Switzer  [24]  and  Billingsley 
[2]  yield  the  strong  mixing  property.   Switzer  proved  that 
for  the  ordered  set  of  colllnear  points  on  a  planar  Poisson 
Process,  f^.}.^  will  have  the  Markov  property  for  j  =1, 
2,...,k,  and  thus  the  set  of  X.  vectors  will  satisfy  the 
Markov  property.   The  general  strong  mixing  property  will 
be  satisfied  if  for  two  events,  G  defined  as  a  function  of 
the  information  from  the  first  i  points,  and  J3  defined  as  a 
function  of  the  information  from  the  points  after  the 
(i+m-1)    point,  then 

|P(GB)  -  P(G)P(B)  I  <  a(m)  I  0, 

as  m->°°.   Billingsley  [2]  showed  that  a  stationary  Markov 
Process  satisfies  the  strong  mixing  property,  so  that  con- 
dition (i)  has  been  satisfied  for  the  Poisson  Lines  Model. 
For  this  model,  the  covariance  function  is 

cov. (h)  =  P  (1  -  F.)e"Xh 

which  will  now  be  used  in  showing  that  conditions  (ii)  and 
(iii)  both  hold. 

To  show  the  existence  of  the  second  condition,  consider 

Irsw(ml'm2)  =  E(ulr  ^s  V^        ^.6.2) 
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and 

I   (m. ,m. )  -  E(u    u    ) .         (5.6.3) 
i     J 

Then 

W'V'V  =  Irsw(mrm2)-FrI5W(ml'm2) 

-FSIrw(1'm2)-FwIrs(1'ml)+2FrFsFW   (5'6-2> 

Note  that  r,s,w,  represent  different  phases  on  the  mosaic 
so  that  several  cases  must  be  studied. 

CASE  I:   Consider  r  =  s  =  w.   Without  loss  of  generality,  call 
this  phase  1.   Then 

I   (1  ,m  ,m  )  =  P(U-   =  1  ,  u  1  =  1  ,  u   .  =  1), 
11     12        11  m,  1       rripl     ' 

which  can  occur  in  any  one  of  three  ways: 

A:   All  three  points  belong  to  the  same  phase  1  cell. 

B:   The  first  two  points  belong  to  the  same  phase  1 
cell,  while  the  third  belongs  to  a  different 
phase  1  cell. 

C:   The  second  and  third  points  fall  in  the  same 
phase  1  cell,  but  the  first  point  is  on   a  dif- 
ferent phase  1  cell. 

D:   All  three  points  are  on  different  phase  1  cells. 
Therefore , 

I111(lJm1,m2)  =  P(A)  + P(B)  + P(C)  + P(D) 


where 


P(A)  =  Fie 


-A(m2-1)A 


P(B)  =  F^e     1 


1  -  e 


-A  (nip-m  )A 


P(C)  =  Fj 


1  -  e 


-A(m1-1)A 


-A (m„-m  )  A 


and 


P(D)  =  F 


1  -  e 


-X(m,-1)A 


1  -  e 


-A  ( nip-m,  )  A 


Therefore, 


-A(m  -1)A   „         -A(m0-1)A 
Iin1(miJm0)  =  e     J     (F^  -Fp  +e    2 


'iir  r  2 


2    2    3 


(F1  -F1  -F2  +Fp  + 

-A (m„-m^  )  A 


-A (m„-mn ) A 


2   1   (fJ-fJ)  +fJ,  (5.6.5) 


I-11(m1,m2)  =  F  i 


'2  '"1 


+  F 


1  -  e 


-A (m„-m, ) A 


1 


-A (m0-m, ) A    _ 
=  Px(l  -P1)e    ^   2   +FJ 


(5.6.6) 


When  these  expressions  are  substituted  into  the  expressit 
f°r  Ml]i'riVm2')' 


M111(m1,m2)  =  G1(F1)e 


-A  A  (nip-m,  ) 


where  G1(F1)  is  a  third  degree  polynomial  function  of  F 
To  prove  that  this  moment  is  finite  over  all  values  of  i 


and  m„ ,  observe  that 


=lm=l      l       l 


-AA(m2-l) 


2    "*  "1 
by    result    of   the    ratio    test 


-AA(mp-l) 
I      m1G(F1)e  <  » 

rhp=l 


CASE_II_:   Let  r  -  s  ?  w.   To  simplify  notation,  let  r  =  l, 


I112(m15m2)  ==  PCUn-l.^^-l.u^   2 


=  1 


1 


which  occurs  when 

A:   The  first  two  points  belong  to  the  same  cell  of 

phase  1,  and  the  third  in  a  phase  2  cell. 
B:   All  three  points  fall  in  different  cells,  the 
first  two  being  phase  1  cells  and  the  third 
being  a  phase  2  cell. 


P(A)  =  P  e 


-X(m,-1)A 


1  -  e 


■A  (m„-m  )  A 


2  ' 


P(B)  =  F 


1  -  e 


-A(m, -: 


1)A~]   f"    -A(rn9-m1  )A] 


(5.6.7) 


F2.  (5.6.8) 


Therefore,  \i    12  (m,  ,m? )  again  reduces  to  the  form  of 


G2(F1,P2)e 


-A(m2-1)A 


which  when  summed  over  all  values  for  iru  and  m„  is  finite 
CAGE  III  :   Let  s  =  w^r,  and  let  r  =  1  and  s  =  2.   Then 


I122(m15m2)  «  P^-l.u^^  -1,^^2-1), 


1 


which  occurs  when: 

A:   The  second  and  third  points  share  a  common  phase 
2  cell. 


B:   All  three  points  appear  in  different  cell: 


P(A)    =    F 


1  - 


1  -  e 


-A(m1-1)A 


F   e 
2 


-A  (rrip-m,  )  A 


(5.6.9) 


P(B)    -    F 


1  -  e 


-A(m1-1)A 


2 


1  -  e 


-A  Cnip-m,  )  A 


_jF2  .      (5.6.10) 


Therefore 


I22(m1,m?)    =    -F   F2e 


-A(m   -1)A  _A(m   -i)A 

-F1F2(1-F2)e  d 

-A (m.-m1 ) A 


+   F1F2(1  -F2)e 


2      1 


so    that 


M122(m1Jm2)    =   G    (Fl5F9)e 


-A(m2-1)A 


and 


I        I    y122(ra1 


m   =1    m   =1 


rrip  )  <  °°    . 


CASE    IV:       Let    r  /  s  /  w.       To    simplify    notation,    let    r  =  l, 
s  =  2 ,    w  =  3  , 


WW   =   ^ll81'^'1'^     3SD 


=    F    F    F 
12    3 


1  -  e 


•A(ni1-1)A 


1  -  e 


-A (iru-m, ) A 


(5.6.11 


Therefore , 


P123(m1,m2)    =    Gi|(P1,F2,P    )e 


-A(m2-1)A 


which  is  finite  when  summed  over  m,  and  m?,  and  the  condi- 
tion (ii)  has  been  satisfied. 


The  third  condition  needed  to  satisfy  Hannan ' s  theorem 
states  that  the  matrix  of  spectral  density  functions, 
[f(e)],  is  continuous  at  e  =  0  and  that  the  trace  [f(e)]  is 
uniformly  bounded  over  all  values  of  e.   As  previously 
defined  by  (5.6.1), 


f   (e)  =   r'     r-    e-lhEe-AAlh'dh 


"2tT 


F  ( 1  -  F  ) 

r      r      AA 


(AA)2  +  e2 


(5.6.12) 


For  r  ??  s  , 


C   (h)  =  F  (l-e-AAlhi)F  _F  F 
rs       r  s    r  s 


c  -c   -AA  h 

=  -F  F^e    ' 


so  that 


(AA)  +  e 


Noting  that  each  f   (e)  is  continuous  at  e  =  0,  and 


trCf(e)]  =  — - M ~  I    F  (1-F  ), 

[(AAT  +  e  ]  r=l  r 
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which  is  bounded  by  trCf(O)].   Therefore,  all  conditions 
have  been  satisfied,  and 


•n 


n 

"Fl 

n 

-Fk 

has  an  asymptotic  normal  distribuiton  with  zero  mean  vector 
and  asymptotic  variance-covariance  matrix,  V    ,  where 


(n) 


2F. (1  -  F. ) 

__J J_ 

AA 


and 


(n)  _     r  s 

rs       X  A 

( n ) 

for  r/s.   To  estimate  the  terms  of  V    ,  recall  from 

(5-^.2)  that 


(n  -  s)F. (1  -  F. ) 
J      J 


is  an  unbiased  estimator  of 


I-tt(sA)  =  1  -  e 


-AsA 


for  a  Poisson  Lines  Model.   Using  a  Taylor  expansion  on 


-AsA   .  -,  , 
e      yields 


k 
1=1  S 


(n  -  s) 


1  -  If 
j  =  l  ' 


as  an  estimator  of  AsA.   Thu: 


n  -  1 


n-1 


s=l 


3=1    k 


(n-s) 


i-   If! 

j  =  i  ■• 


sA 


(5.6.15) 


is  a  reasonable  estimator  of  A.   With  N./n  estimating;  F., 
Var  n   Can  now  be  estlmated.   Also,  recalling  from  the 
lineal  estimation  procedure  that 


(L. 
L 


AL 


for  a  lineal  Poisson  Process  of  intensity  A,  the  result  from 

systematic  point  counts  yields  a  similar  expression  with  L 

being  approximately  equal  to  nA,"  and  F.  estimating  p.. 

J  J 

5- 7   Extensions  of  Point  Sampling 

In  developing  the  theory  behind  point  sampling  on  the 
line,  it  was  noted  that  the  line  represents  a  one-dimen- 
sional cut  from  a  higher-dimensioned  mosaic.   Koop ' s  1976 
results  come  from  21  different  variations  of  point  sampling  in 
two   dimensions.   Two  methods  will  be  discussed,  since  their 
results  can  be  tied  in  with  the  results  already  obtained  in 
this  chapter. 

The  first  sampling  procedure  of  random  points  on  random 
lines  locates  b  lines  randomly  and  independently  inside  the 
mosaic,  such  that  a  points  are  placed  at  random  locations  on 
each  line,  accounting  for  n = ab  points  on  the  plane.   Define 


86 


lm 


1    if  the  i    point  on  the  m    line 
belongs  to  a  phase  j  feature 


0    otherwise, 


and  let 


and 


Then 


and 


Thus, 


pmi  "  the  Pr°Portion  of  the  line  occupied  by  the 


mj 


j-th  phase. 


P-  -  the  proportion  of  the  plane  occupied  by  the 
j-th  phase. 


P(uim  =  1  'llne  ^    "  Pmj 


E(u-J  )  =  E  E(u^   l.line  m)  =  p 


a    b   . 
n        n 


gives  an  unbiased  estimate  of  p . ,  with 

J 


a        b 


I    \     I    var(u.     )  +     T    cov(u'l     ,u^4 
i-1    to-1  im         m/t  im      l1 


+       I  ■{    I    covdl!     ,u']     )+     I    covdA     ,uJ'     ) 
jfi_    _ri  im'    sm  H.  im'    st 


i^s    m=l 


m^t 
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Due  to  the  random  and  independent  location  of  the  lines  on 
the  plane, 

cov(u"?  '  u^,  )  =  0 

lrn  '  st 

for  all  m^t,  so  that  only  the  first  and  third  terms  of  the 
variance  expression  need  to  be  considered.   Now 

v(u.Jm)  =  Pjd-Pj), 

and 

cov(u.  ,  uJ  ) 
ira   sm 

=  EmCOv(uim'Usm  '  line  m)  +  cov  E(u'|m'Usm  '  llne  m) 

m 

=  cov(pm.,Prn.)  =  Var(pm.)  =  a^   , 

17 
where 


a:   =  v., 

L;      J  J 

lL 

L 


as  previously  derived  in  Section  3.H. 
Thus, 


^  -  P.j(l-P.1}  ,  ab(b-l)  , 
n         n  2      CTL 

L 
Pj(l-p,1}  ,  fl   1 

_i 

L 


(5.7.1 


which  yields  the  same  result  as  Koop  C9].   Note  that  the 

presence  of  aT2   in  the  variance  expression  reflects  the  two- 

_i 
L 

stage  process  of  first  sampling  lines  from  the  mosaic,  then 
subsampling  points  from  each  transect.   Note  also  that  one 
needs  a  value  for  a£   even  though  point  sampling  is  employed. 
L 


5 .8   Systematic  Points  on  Random  Lines 

The  other  sampling  procedure  to  be  discussed  again 
begins  with  the  independent  location  of  b  random  lines 
inside  the  mosaic.   On  each  line  is  placed  a  systematic 
array  of  a  points,  again  resulting  in  n = ab  points  on  the 
plane.   Using  the  same  notation  as  in  Section  5.7, 


a    b 

1=1  m=l 


;ives  an  unbiased  estimate  of  p . ,  with 

J 


b 


I        I   varCu^)  +    J        I   cov(u      ,uJ    ) 
1  =  1    m  =  l  lm        i£s    m=l 


V(uJlm)    =   VmE(uJlm  |   line   m)  +EmV(u^  |  line   m) 


V    (P     .)  +E    [p     .  (1  -  p     .  )] 

m    e  mj  m   '  mj  "  mj 


■  pj(1-pj>' 


and 


cov(u.  ,u'-  ) 
im'  sm 


J   „J 


=  EmCOv(uim'Usm  I  llne  m)  + cov  E(uim'Usm  I  llne  m) 


=  Em[pm.i  (1  "  Pm.i  )1T(  I  *-*!*)+  c°v(p. jP.  )  ] 


mj   mj 


=    tt(|1  -  s|A)[p    (1  -p.)  -aT2    ]  +oT2 


Thus,  the  resulting  expression  for  V 


(5.8.1) 


will  look  like 


(5-3-2)  with  additional  terms  of  a2   to  account  for  the 

L 

two-stage  sampling  procedure. 


5- 9   Comparisons  with  Hilliard-Cahn 

Hilliard  and  Cahn ' s  work  with  one-dimensional  random 
point  counts  gives  an  expression  that  is  the  same  as 
(5-7-D •   Their  results  from  systematic  point  counts  are 
derived  under  very  rigid  restrictions,  one  being  that  a 
phase  ,j  feature  will  not  occupy  more  than  1  point  on  the 
line.   To  equate  this  with  the  conditions  described  earlier 
in  Section  5-3  would  necessitate 
(i)   cells  being  extremely  small  or  equivalently  n  being 
small  and  A  being  large  in  relation  to  L. 
(ii)   F.  being  very  small. 

Under  all  of  these  restrictions,  the  Hilliard-Cahn  variance 
expression  becomes 
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fN 


F. 

J 


which  would  yield  a  very  special  case  of  the  result  from 
(5.3.2). 


CHAPTER  6 
MULTISTAGE  SAMPLING 


6.1   Introduction 


As  mentioned  earlier,  it  is  possible  that  the  lineal 
and  point  sampling  procedures  discussed  in  Chapters  3,  k, 
and  5  may  have  resulted  from  a  subsampling  of  a  higher 
dimensional  sectioning  of  the  mosaic.   When  this  occurs, 
multiple  variance  terms  must  be  accounted  for  at  each  stage. 
The  use  of  a  lineal  transect  may  yield  estimates  for  struc- 
tural features  of  a  two-dimensional  mosaic,  but  that  planar 
mosaic  may,  in  fact,  be  a  cross-section  of  a  three-dimen- 
sional figure.   The  volume-fraction  of  the  void  in  a  piece 
of  copper  might  only  be  studied  from  looking  at  a  cross- 
section  of  the  specimen  under  a  microscope.   If  lineal  or 
point  sampling  is  used,  the  sample  fraction  due  to  the  void 
will  in  essence  be  estimating  the  areal  fraction  of  the  void 
on  the  planar  sample.   Thus,  any  results  that  would  be 
obtained  from  a  point  count  or  lineal  measurement  may  have 
to  account  for  one  or  more  additional  stages  of  sampling. 

6 .2   Lineal  Subsampling 

The  first  case  to  be  considered  involves  passing  a 
lineal  transect  through  a  planar  section  of  area  A,  which 
has  been  sampled  from  a  higher-ordered  mosaic.   With 
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notation  as  defined  in  Chapters  3  and  H,    let  L.  denote  the 
total  length  of  phase  j  segments  on  the  line.   Finding  the 
expectation  of  L./L  will  now  be  a  2-step  process,  first 
conditioning  on  the  planar  section,  G,  and  then  taking  the 
expectation  over  the  entire  mosaic.   As  proven  by  Miles  [15] 
in  (2.2.11), 


(L 


f  1° 


and  the  expectation  of  A. /A  over  the  entire  mosaic  will  be 

V./V  =  p. . 
J       J 

To  obtain  the  variance  of  L./L  as  both  L  and  A  go  to 
infinity,  the  two-stage  process  uses  the  asymptotic  variance 
approximations  from  Chapters  3  and  4.   One  has  that 

(L.  ,  )  „fl, 


V 


* 
=  VQE 


J-  G 


+  E„V 
G 


£\a 


* 

VG 


(A" 

J 
A  j 

+  EG 

T^io2   +V2)4- 


A.      A 


Ly 


1  - 


where  \i    and  o2    denote  respectively  the  mean  and  variance  of 
a  cell  intercept.   Therefore, 


=  aA.+LU(°2+^ 
J 


P,  -E' 


2,1,2      2 

=  %.  +L;7(a   +^ 
J 


PJ  "  PJ  "  % 
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22       T,    A 
Aj_   L.   Pjd-Pj) 

A     L 


(6.2.1) 


for  p.   5^  0,1,  where  a^.   and  afl   denote  the  one-stage  variance 

-1       _i 

L        A 
expressions  from  (3. 4.4)  and  (4.2.1)  respectively.   An  esti- 
mate for  o£   can  be  made  from  the  methods  described  in 

L 
Chapter  3-   However,  information  on  the  areal  section  will 

be  needed  to  gather  an  estimate  for  o2    . 

21 

A 

6. 3   Point  Subsampling 

If  areal  estimation  is  to  be  achieved  through  point 
sampling,  the  end  result  may  be  attributed  to  subsampling 
from  a  lineal  transect,  as  developed  by  Koop  [9]  and  men- 
tioned in  Chapter  5.   Regardless  of  whether  a  random  or 


systematic  point  cound  is  used,  N./n  as  defined  in  Chaptei 


lapter 


5  will  be  an  unbiased  estimator  for  A. /A.   For  a  two-stage 
systematic  point  count  on  a  line  £ ,  the  asymptotic  variance, 


-  V£E 


N 

n  I  line£l  +E£V' 


J-  u 


=  °L.  +EZ 
.A 
L 


L. 


1  +  -  I    (n  -m)TT(mA) 
m=l 


which  reduces  to 


a 


°L.+°N  " 

L    n 


L.  aN. 
JL     _i 

L   n 


A. 
J 
A 

A. 

(6.3.1) 


with  approximations  as  developed  earlier  in  Chapters  3  and 
5. 

If  the  systematic  point  process  is  generated  from  a 
lineal  transect  which  in  turn  has  been  taken  from  an  areal 
section  of  the  mosaic,  then  a  three-stage  sampling  has 
occurred,  and  will  involve  variance  components  from  all 
three  stages.   Again,  N./n  is  unbiased  for  p.,  with 


=  Var  E 

G,  £ 


-*■   line  £ 


fN 


+  E  V 
G.£ 


=  V 
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(L. 
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+  E 


G,£| 


L. 
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L. 


1  +  -  I    ( n  -  m )  n  (mA ' 
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°A.  °L. 
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L     A 
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n    2 

n   m=  1 


I    (n  -  m)ir(mA) 


°A.  °L 
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'A.+0L.+°N. 

_£      _jl      .J.    *  J 

A     L     n 


P,(l  -P,) 


A .  L .    A.N.    L.N. 
A   L    An    L  n 


°N.°L.aA 


n   L 


CPjCl-Pj)]" 


(6.5.2 


Note  that  the  three-stage  sampling  procedure  incorporates 
the  asymptotic  variances  from  all  three  stages  in  a  recog- 
nizable pattern,  parallel  to  the  results  achieved  by 
Anderssen  [ 1 J . 


CHAPTER  7 
NUMERICAL  RESULTS 

7 . 1   Introduction 

The  variance  approximations  suggested  in  the  past  four 
chapters  have  been  individually  developed  under  specific 
assumptions  on  the  mosaic.   This  chapter  will  numerically 
compare  some  of  the  approximations  developed  from  Chapter  3, 
so  that  some  practical  guidelines  can  be  given  on  the  vari- 
ance expression  to  be  used  for  estimates  obtained  from  lin- 
eal sampling  on  the  mosaic.   Four  mosaics  were  each  sampled 
by  placing  a  grid  of  lines  of  fixed  length  within  the 
boundaries  of  the  two-dimensional  pictures.   Table  I  gives 
the  results  of  the  lineal  sampling  of  a  random  lines  mosaic 
(Poisson  Lines  Model)  given  by  Pielou  [16].   The  mosaic 
studied  by  Table  II  is  a  simple  case  from  Matern  [12]  of  a 
realization  of  the  Boolean  Scheme,  with  each  6.  set  being; 
a  circle  of  fixed  radius.   The  results  of  Table  III  come 
from  an  ad  hoc  picture  consisting  of  many  small  features, 
which  are  similar  in  size  and  shape,  and  fairly  regularly 
spaced.   Tables  I  -III  are  used  to  compare  the  variance 
expressions  of  the  area 1-f raction  estimators.   Table  TV 
gives  variance  approximations  for  data  gathered  to  esti- 
mate the  boundary  length  of  Lake  Alice,  on  the  University 
of  Florida  campus. 
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The  areal-fraction  variance  terms  compared  in  Tables 
I  -  III  are  the  v..  terms  under: 

(1)  The  assumption  of  the  Poisson  Lines  Model,  where  v* 
is  given  by  (3.11.1). 

(2)  The  assumption  of  the  Boolean  Scheme,  which  gives  v 

Jj 
by  (3.15.4). 

(3)  The  technique  developed  by  Milliard  and  Cahn  [8], 
which  utilizes  the  individual  segment  information  by 
(3.10.1). 

7.2   Results:   Table  I 

The  results  of  Table  I  give  very  reasonable  approxi- 
mations from  the  expression  (3-11.1),  in  comparison  with 

s^   =  .0*126, 
L 

which  is  the  sample  variance  of  the  individual  areal- 
fraction  estimates  from  the  L./L  column.   The  only  times 
for  which  the  approximation  seemed  to  differ  greatly  was 
for  lines  3  and  H,    for  which  L./L  was  small,  resulting  in 
the  numerator  of  v    being  small. 

The  Hilliard-Cahn  approximations  from  these  lines 
tended  to  underestimate  the  lineal-fraction  variance.   This 
was  not  a  surprising  result,  in  that  the  model,  obtained 
from  Pielou  [16],  resulted  in  cell  sizes  that  were  fairly 
large  in  comparison  with  the  length  of  the  sampling  tran- 
sect.  This  allowed  for  very  few  segments  from  each  phase, 
and  accounted  for  minimal  variability  from  segment  to 


segment,  which  results  in  a  small  ratio  of  the  s?  terms  to 

-2 
the  y.  terms.   Note  that  in  line  7,  only  one  phase  2  seg- 
ment was  on  the  transect,  so  that  an  approximation  by  this 
approach  was  not  possible. 

In  looking  at  the  results  over  all  ten  lines,  it  was 
suggested  that  a  wider  spacing  of  the  lines  might  produce 
a  more  "random"  sampling  effect,  so  the  second  section  of 
Table  I  was  added.   This  resulted  in  approximations  that 
were  very  close  to  the  sample  variance  terms  of  the  lineal- 
fraction  estimates. 

7.3   Results:   Table  II 

Originally,  the  mosaic  yielding  the  results  of  Table 

II  was  sampled  in  order  to  test  the  variance  approximations 

from  a  mosaic  arising  from  a  Boolean  Scheme.   Unfortunately, 

the  picture  taken  from  Ma tern  [12]  again  had  large  6.  sets 

J 

in  comparison  with  the  fixed  length  of  the  sampling  tran- 
sect.  This  resulted  in  individual  variance  approximations 
using  (3.15-4)  being  very  close  to  zero,  and  sometimes  neg- 
ative.  In  looking  at  (3.15.4), 


w 


("r)   2r2M 


JJ 


The  first  term  of  this  expression  yielded  values  that  were 

2 
close  to  Sj  =.0301,  but  the  second  term  tended  to  subtract 

L 

off  too  much,  resulting  in  the  variance  approximation  being 

too  small.   This  might  be  accounted  for  from  only  generating 
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the  first  two  terms  in  the  Taylor  expansion  for  eTK(h),  in 
obtaining  (3.15.4).   The  third  term  might  be  able  to  com- 
pensate for  the  amount  subtracted  by  the  second  term. 

This  model  did  allow  for  comparisons  between  the 
Hilliard-Cahn  and  Poisson  Lines  approximations  for  the  var- 
iance of  I^/L.   Again,  the  Hilliard-Cahn  approximations 
tended  to  be  smaller  than  the  sample  variance,  which  can  be 
attributed  to  the  regularity  of  the  3 .  sets  (circles  of 
same  radius),  thus  allowing  for  a  minimal  amount  of  vari- 
ability from  segment  to  segment. 

The  approximations  under  the  Poisson  Lines  Model  were 
much  more  accurate  than  the  Hilliard-Cahn  results,  although 
in  general,  they  also  tended  to  underestimate  s2   =  .0301. 


It  was  interesting  to  see  how  the  assumption  of  an  incor- 
rect model  would  affect  the  variance  approximations  of  the 
lineal-fraction  estimates.   For  this  case,  the  approxima- 
tions derived  from  the  Poisson  Lines  Model  used  on  a 
Boolean  Scheme  mosaic  were  still  fairly  good. 

Again,  by  looking  at  a  wider  spacing  of  the  lines,  it 
was  hoped  that  the  estimate  from  the  assumption  of  the 
Boolean  Scheme  might  give  some  accurate  variance  approxi- 
mations, but  again  the  second  term  of  (3.15.4)  tended  to 
subtract  too  much  from  the  variance  expression.   How- 
ever, the  Poisson  Lines  approximations  seemed  to  be  reason- 
ably accurate  with  the  wider  line  spacing. 
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7.4   Result; 


Table  III 


After  studying  the  results  from  Tables  I  and  II,  it 
was  decided  to  try  to  find  a  model  for  which  the  Hilliard- 
Cahn  technique  might  be  appropriate.   The  mosaic  studied 
by  Table  III  had  many  small  features  with  relatively  little 
variation,  so  that  each  sampling  transect  studied  had  four 
phase  one  segments  and  five  phase  two  segments.   With  this 
much  information  to  utilize,  the  Hilliard-Cahn  technique 
gave  reasonable  approximations  to  the  sample  variance. 
Five  additional  lines  were  included  in  the  sampling  grid  to 
check  for  variability  of  the  areal  fraction  estimates,  and 
the  sample  variance  from  the  10  line  grid  made  the  Hilliard- 
Cahn  results  look  even  more  accurate  than  ever  before,  and 
certainly  much  improved  over  the  results  from  Tables  I 
and  II. 

Surprisingly  enough,  again  under  false  modeling  assump- 
tions, the  foisson  Lines  variance  approximations  seemed  to 
be  fairly  close  to  the  ten-line  sample  variance.   However, 
this  was  the  first  time  in  the  three  models  studied  that 
these  approximations  tended  to  generally  exceed  the  sample 
variance . 


7-5   Result; 


Table  IV 


The  sampling  from  the  University  of  Florida  map  was 
originally  done  for  the  purpose  of  estimating  the  boundary 
length  of  Lake  Alice.   Recall  from  (2. 2.6.)    that  the  tt/2 
times  the  number  of  intersections  per  unit  length  the 
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sampling  transect  makes  with  the  boundary  gives  an  unbiased 
estimate  of  boundary  length  per  unit  area.   Modifying  the 
variance  term  of  (3.8.7), 


C^pJ  +  (1-Pir}  +cJ|H— -2y 


(2) 


(7.5.1 


Estimating   C^    by    N-L/2L   and    p      by    L   /L   gives 

2 


2N- 


+     1  - 


(7-5.2 


assuming  the  C   term  of  (7-5.1)  will  be  very  small.   From 
the  four  line  grid  used  on  Lake  Alice,  the  variance  terms 
tended  to  exceed  the  sample  variance  of  the  N , /L  values, 
but  when  a  grid  of  15  lines  was  used  on  the  mosaic,  the 
sample  variance  of  l^/L  was  bigger  than  any  of  the  indi- 
vidual variance  approximations,  implying  that  this  vari- 
ance approximation  appears  to  be  reasonable  for  boundary 
length  estimation. 

7 . 6   Conclusions 


While  no  major  results  could  be  concluded  from  a  study 
of  three  theoretical  mosaics,  several  observations  can  be 
made  from  the  results  of  the  four  tables.   For  any  vari- 
ance approximation  to  yield  accurate  results,  the  features 
should  be  small  in  relation  to  the  overall  size  of  the 
mosaic,  which  could  allow  for  a  relatively  long  sampling 
transect.   The  results  from  the  Hilliard-Cahn  technique 
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appear  to  be  more  and  more  accurate  when  the  individual 
features  are  widely  varied,  and  the  transect  intersects 
many  features.   The  Poisson  Lines  approximations  appear 
to  be  reasonable  for  a  random  arrangement  of  features, 
which  can  be  tested  from  the  method  described  in  Section 
3-13.   However,  the  Poisson  Lines  approximations  also  gave 
close  results  under  the  false  modeling  assumptions,  so 
that  this  method  might  be  used  for  situations  in  which 
sampling  information  is  not  easily  obtainable. 

Unfortunately,  the  Boolean  Scheme  only  gave  crude 
approximations,  for  reasons  explained  in  Section  7.3,  but 

perhaps  for  models  with  very  small,  but  fairly  congruent 
features  randomly  situated  about  the  mosaic,  this  method 

might  yield  significant  results,  even  if  only  the  first 

term  of  (3.15.4)  is  used. 

The  variance  approximation  from  Section  7.5,  used  in 

boundary  length  estimation  seemed  to  be  reasonably  close 

to  the  sample  variance,  and  may  yield  even  better  results 

for  irregularly  shaped  features  that  may  not  necessarily 

be  convex. 
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Table  I 

Poisson 

Hilliard-Cahn 

Line 

L1/L 
•  316 

L?/L 
.684 

2 

M 

2 

Vll 

Vll 

1 

.04  67 

.0274 

2 

•  337 

.663 

2 

2 

.  0499 

.0112 

3 

.168 

.832 

2 

3 

.0156 

.0163 

4 

.200 

.800 

2 

3 

.  0205 

.0895 

5 

•  3  37 

.663 

2 

3 

.0399 

•  0139 

6 

.474 

.526 

2 

2 

.0622 

.0237 

7 

•  737 

.263 

2 

1 

.0501 



8 

.716 

.284 

2 

2 

.0413 

•  0393 

9 

.663 

•  3  37 

3 

2 

.0399 

.0208 

10 

.453 

.547 

3 

3 

.0491 

.0593 

Averages 

.  4401 

•  5599 

2.2 

2.3 

Over  all 

10 

lines: 

Poisson: 

v(p3) 

=  . 

0552 

p 

SL. 

.0426 

Hilliard- 

Cahn: 

v(p2) 

= 

02  64 

J 
L 

Poisson 
Lines         P\         s|        V(p1) 


1,3,5,7,9  .4442  .0595  .0544 

2,4,6,8,10  .4360  .0364  .0550 

1,4,7,10  .4265  .0536  .0563 

2.5.8  .4633  .0480  .0571 

3.6.9  .4350  .0624  .0519 
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Table    II 


Boolean 

Scheme 

Milliard 

flst  term] 

P 

oisson 

Cahn 

I   only   J 

Line 

L7L 

L„/L 

Mn 

M^ 

~  * 

~# 

~  * 

1 

.191 
.124 

2 

.809 

.876 

1 

2 
2 

2 

3 
2 

11 

vll 

Vll 

1 
2 

.0191 
.  0118 

.0122 
.0098 

•  0355 
.0248 

3 

.51^ 

.486 

5 

5 

.0250 

.0164 

.0571 

4 

.629 

.371 

5 

5 

.0218 

.0107 

•  0533 

5 

.514 

.486 

5 

5 

.0250 

.0218 

.0571 

6 

.448 

•  552 

5 

5 

.0245 

.0082 

.0565 

7 

.  314 

.686 

4 

3 

.0265 

.0102 

.0492 

8 

.276 

.724 

3 

4 

.0228 

.0062 

.0457 

9 

.219 

.781 

2 

3 

.0234 

.2000 

.0391 

Averages 

.3588 

.6412 

3.67 

3-89 

Over  all 

9  lines: 

Poisson : 

V( 

PX)  = 

.0280 

sL   =  .0301 
J* 

Hilliard-Cahn: 

V( 

PX)  = 

.0216 

L 

Boolean 

Scheme : 

v( 

PT)  = 

.0098 

Boolean 

Scheme 

(1st  term 

only ) : 

v( 

p:)  - 

.0526 

Poi  sson 

Boolean 
Scheme 

Boolean 

Scheme 

[1st  term] 

(   only   J 

Lines 

P 

•  3504 

2 
s~ 
_P 

.0244 

v(px) 

^(pV 

vtp^) 

1,2,5,7,9 

.0280 

.0102 

.0520 

2,4,6,8 

.3693 

.0475 

.0280 

.0091 

.0532 

1,4,7 

•  3780 

.0510 

.0301 

.0123 

.0537 

2,5,8 

•  3047 

.0387 

.0257 

.0088 

.0484 

3,6,9 

■  3937 

.0240 

.0274 

.0067 

.0546 

Table  III 
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Line 

1 
2 

3 
4 
5 

Averages 


Lx/L 


L2/L 


Poisson 

Vll 


•  3353 

.6647 

4 

5 

.0221 

.1824 

.8176 

4 

5 

.  0099 

.2235 

.7765 

4 

5 

.0134 

.  2294 

■  7706 

4 

5 

.0139 

.2647 

•  7353 

4 

5 

.0168 

.2471 

.7529 

4 

5 

Hilliard-Cahn 

A     * 
Vll 

.0125 
.0089 
■0135 

.0047 
.0060 


Over  all  5  lines: 


Poisson:      V(p, )  =  . 0154 


s    =  .0032 

1 


Hilliard-Cahn 


vCf^)  =  .0093 


Over  10  lines 


sT   =  .0.10! 
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Table  IV 


Line 

•  395 

2 

V(N1/L) 

1 

.000318 

2 

.2  35 

2 

.000390 

3 

.568 

4 

. 000621 

4 

.469 

4 

.000612 

Average 


4168 


Over  all  4  lines: 


VCN-j/L)  =  .000470 


000203 


From  a  15  line  grid 


OOIO85 
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